SlideShare a Scribd company logo
inteSearch: An Intelligent Linked Data Information Access 
Framework 
Md-Mizanur Rahoman, Ryutaro Ichise 
November 11, 2014
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Outline 
Introduction 
Background of Linked Data Information Access 
Problem and Probable Solution 
Proposed Retrieval Framework: inteSearch 
Pre-processing of Linked Data 
Framework Details 
Experiment 
Conclusion 
Md-Mizanur Rahoman, Ryutaro Ichise j 2
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Linked Data (LD) 
are structured data 
represent knowledge with tuples like 
<< Subject, Predicate, Object >> 
which called as RDF triples 
can be represented by graph 
can use SQL-like expressive query 
store, as openly available, 
2122 datasets, 61 billion 
RDF triples (as of Apr. 2014) 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
Supervisor Spouse 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
Md-Mizanur Rahoman, Ryutaro Ichise j 3
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Information Access over LD 
It require 
sub-graph
nding over LD graph 
impose sub-stantial execution cost, 
if graph size get bigger 
know-how of (dataset speci
c) 
vocabulary, schema, LD query 
(i.e., linked data semantics) 
demand domain-level expertise 
expect automated tool to 
understand linked data semantics 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
Supervisor Spouse 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
:spouse 
:dnld 
:birthPlacelabel 
:grce 
Donald 
label 
Greece 
Md-Mizanur Rahoman, Ryutaro Ichise j 4
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Contemporary LD Information Access Systems 
Language-Tool-Based-Systems (PowerAqua'06, TBSL'12, 
FREyA'11, SemSek'12, CASIA'13 etc.) 
use language tools (e.g., parser, POS tagger etc.) to predict possible 
sub-graphs (over LD graph) 
convert sub-graphs to
nd SPARQL query 
Pivot-Point-Based-Systems (Treo'11, NLP-Reduce'07 etc.) 
pick a query word (i.e., pivot point), then try to pick other query word 
w.r.t. the pivot point and predict a possible sub-graph (over LD graph) 
convert sub-graph to
nd SPARQL query 
Md-Mizanur Rahoman, Ryutaro Ichise j 5
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Language-Tool-Based-Systems 
Problem 
generate many improper parsed trees - dierent parser gives dierent 
parsed trees, with dierent parsing tags. 
tag for improper semantics (e.g., miss tagging of query words, such as 
whether query word spouse should be tagged for Object or 
Predicate) 
generate empty result or improper result - choosing incorrect sub-graph 
Md-Mizanur Rahoman, Ryutaro Ichise j 6
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Pivot-Point-Based-Systems 
Problem 
depend heavily upon picking correct pivot point - most of the cases, 
systems pick NE (named entities) related pivot points
rst, then other 
pivot points 
impose huge cost, if pivot point need to change - one pivot point can 
have multiple LD resources 
miss contextual information attachment e.g., random choosing of pivot 
points could generate very dierent result 
Md-Mizanur Rahoman, Ryutaro Ichise j 7
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Problem Statement  Probable Solution 
Problem Statement 
To LD information access, how can we
nd the required sub-graph 
(over LD graph) within minimum execution cost that 
will not generate empty result 
will not miss contextual information of query 
Solution 
To
nd correct sub-graph - check maximum possible sub-graph 
generation possibility 
To achieve minimum execute cost - prepare pre-processed LD statistics 
which insight sub-graph generation possibility 
To not lose contextual information of query - adapt a sub-graph 
joining technique called Progressive Joining Approach (Rahoman  
Ichise'14) 
Md-Mizanur Rahoman, Ryutaro Ichise j 8
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
inteSearch - Overview 
Pre-processed data statistics 
store LD resources in a way so that they can be picked easily 
store pattern of LD resources so that they can give insight about 
possible sub-graph 
Development of framework 
generate single query word based graph (called as, Basic Graph) 
merge all Basic Graphs to predict all possible sub-graphs (i.e., called as 
Keyword Graphs) 
rank all possible Keyword Graphs using pre-processed data statistics 
generate SPARQL query for the best ranked Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 9
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Pre-processed data statistics 
Label Extractor - extract and store label of LD resource 
lv (r ) = fo j 9  r ; p; o 2 RDF triples of dataset ^ p 2 rrp 
rrp is resource representing Predicates e.g., label, title etc.g 
Pattern-wise Resource Frequency Generator - compute and store 
LD resource pattern frequency 
sf (r ) = j f r ; p; o j 9  r ; p; o 2 RDF triples of datasetg j 
pf (r ) = j f s; r ; o j 9  s; r ; o 2 RDF triples of datasetg j 
of (r ) = j f s; p; r j 9  s; p; r 2 RDF triples of datasetg j 
Md-Mizanur Rahoman, Ryutaro Ichise j 10
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Example of Pre-processed Data Statistics 
Exemplary LD graph 
Supervisor Spouse 
label 
type 
Property 
type type 
:birthPlace :supervisor :spouse 
Birth Place 
label label 
range domain 
domain range 
domainrange 
:Country :Person 
Country 
Person 
label label 
type 
Class 
type 
Schema/Ontology 
:amnd :barl :clra :dnld 
label label 
Amanda 
type 
:grmn :uk :grce 
Germany United 
Kingdom 
Greece 
Donald 
:spouse :supervisor :spouse 
:birthPlace :birthPlace :birthPlace 
:birthPlace 
label label label 
type 
Berlusconi Cleyra 
label label 
Instances 
Country 
label 
:Country 
type 
Class 
Pre-processed data statistics 
r lv (r ) sf(r) pf (r ) of (r ) 
:Country Country 2 ... ... 
:... ... ... ... ... 
Md-Mizanur Rahoman, Ryutaro Ichise j 11
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Basic Graph Generator - generate the Basic Graphs 
Keyword Graph Generator - merge all Basic Graphs to predict the 
Keyword Graphs 
Ranker - rank all possible Keyword Graphs using pre-processed data 
statistics 
SPARQL Query Generator - generate SPARQL query for the best 
ranked Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 12
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 13
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Basic Graph Generator 
Choose one of the three Basic Graphs for each query word 
?o 
?p 
k 
?s , or k 
k , or ?o 
?p 
?s 
decided by (particular) similar LD resources (toward the query word) 
and their pattern frequencies 
e.g., 
if (particular) similar LD resources fR 
g and 
Predicate Pattern-wise Resource Frequency of a LD resource (e.g., 
pf (ri )) is bigger than all Subject and Object Pattern-wise Resource 
Frequencies, then we select Basic Graph 
?o 
k 
?s 
weight computed by highest pattern frequencies of LD resources fR 
g 
Md-Mizanur Rahoman, Ryutaro Ichise j 14
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 15
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Keyword Graph Generator 
Merge all Basic Graphs in their all possible merging options by 
following Progressive Joining Approach 
e.g., merging 1st and 2nd Basic Graphs at all possible options 
k1 
?s ?o 
k 
?p 
?s 2 
1st Basic Graph 
k 
1 
2nd Basic Graph k 
?s1 2 
, and ?s 
k 
?o 
1 
1 
k 
2 
?p 
2 
1 
?o 
k 
?s 
1 
1 k 
2 
?p 
2 
1 
Progressive Joining Approach - if query words with order 
fk1; k2; k3; :::; kmg, then 
join Basic Graph of k1 and Basic Graph of k2 and
nd a 
Intermediate-version Keyword Graph, then 
progressively join next Basic Graph for remaining query words and 
update Intermediate-version Keyword Graph, until there is query word 
Progressive Joining Approach maintain contextual information 
attachment 
Md-Mizanur Rahoman, Ryutaro Ichise j 16
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Progressive Joining Approach - an Example 
Intermediate-version Keyword Graph k 
?p 
?s 
1 
1 ?o 
2 
k2 
1 
?p 
and Next query word corresponding Basic Graph k 
?s 3 
all possible contextualy-feasible Keyword Graph 
Intermediate Next BG Joining between Increase of KG 
Version KG last joined BG 
and next BG 
k 
?p 
?s 
1 
1 ?o 
2 
k2 
?p 
1 k 
?s 3 
k 
k 
2 
?s 3 
1 
?s 
k 
?o 
2 
1 
k 
3 
?p 
3 
2 
?o 
k 
?s 
2 
2 
k 
3 
?p 
3 
1 
k 
k 
2 
?s 3 
1 
?s 
k 
?o 
2 
1 
k 
3 
?p 
3 
2 
?o 
k 
?s 
2 
2 
k 
3 
?p 
3 
1 
k1 
k1 
k1 
Md-Mizanur Rahoman, Ryutaro Ichise j 17
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Development of Framework 
Md-Mizanur Rahoman, Ryutaro Ichise j 18
Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion 
Ranker 
Rank Keyword Graphs for 
Weight - minimum weight of constituent Basic Graphs 
Depth level - how many edges a Keyword Graph holds 
Consider lower depth level Keyword Graphs with higher ranked than 
higher depth level Keyword Graphs 
Md-Mizanur Rahoman, Ryutaro Ichise j 19

More Related Content

What's hot

Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
IJwest
 
ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.
Tatiana Tarasova
 
Classification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF MetricClassification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF Metric
Marie Vans
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
Boris Galitsky
 
SAC 2019 ester giallonardo
SAC 2019 ester giallonardoSAC 2019 ester giallonardo
SAC 2019 ester giallonardo
Ester Giallonardo
 
Python and R for quantitative finance
Python and R for quantitative financePython and R for quantitative finance
Python and R for quantitative finance
Luca Sbardella
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
Jaganadh Gopinadhan
 
Ir 03
Ir   03Ir   03
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Databricks
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
WU (Vienna University of Economics and Business)
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Yanchang Zhao
 
Text Mining with R
Text Mining with RText Mining with R
Text Mining with R
Sanjay Mishra
 
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Ciro Sorrentino
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
Ben Healey
 
Ir 02
Ir   02Ir   02
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structure
cseij
 

What's hot (20)

Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
master_thesis_greciano_v2
master_thesis_greciano_v2master_thesis_greciano_v2
master_thesis_greciano_v2
 
Working with text data
Working with text dataWorking with text data
Working with text data
 
ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.
 
Classification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF MetricClassification of CNN.com Articles using a TF*IDF Metric
Classification of CNN.com Articles using a TF*IDF Metric
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
 
SAC 2019 ester giallonardo
SAC 2019 ester giallonardoSAC 2019 ester giallonardo
SAC 2019 ester giallonardo
 
Python and R for quantitative finance
Python and R for quantitative financePython and R for quantitative finance
Python and R for quantitative finance
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Ir 03
Ir   03Ir   03
Ir 03
 
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey GusevImage Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
Image Similarity Detection at Scale Using LSH and Tensorflow with Andrey Gusev
 
semanticweb
semanticwebsemanticweb
semanticweb
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
07 04-06
07 04-0607 04-06
07 04-06
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
 
Text Mining with R
Text Mining with RText Mining with R
Text Mining with R
 
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
Topic-based Federator Query Engine - Presented at ICWI Budapest 2018
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
 
Ir 02
Ir   02Ir   02
Ir 02
 
Hybrid geo textual index structure
Hybrid geo textual index structureHybrid geo textual index structure
Hybrid geo textual index structure
 

Viewers also liked

TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval FrameworkTLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
National Inistitute of Informatics (NII), Tokyo, Japann
 
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data RetrievalInclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
National Inistitute of Informatics (NII), Tokyo, Japann
 
BoTLRet: A Template-based Linked Data Information Retrieval
 BoTLRet: A Template-based Linked Data Information Retrieval BoTLRet: A Template-based Linked Data Information Retrieval
BoTLRet: A Template-based Linked Data Information Retrieval
National Inistitute of Informatics (NII), Tokyo, Japann
 
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
Tim Wade
 
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw..."Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
Tim Wade
 
Prueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICPrueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICGentedigital
 
세오갤러리
세오갤러리세오갤러리
세오갤러리redcoky
 
An automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked dataAn automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked data
National Inistitute of Informatics (NII), Tokyo, Japann
 
Client Case Studies
Client Case StudiesClient Case Studies
Client Case Studies
sjpitts00
 
The formula for effective facebook ads
The formula for effective facebook adsThe formula for effective facebook ads
The formula for effective facebook adsguest6aff87b
 
Tribus urbanas
Tribus urbanasTribus urbanas
Tribus urbanas
saritaguimar
 
LiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word FinderLiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word Finder
National Inistitute of Informatics (NII), Tokyo, Japann
 
2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!
Tim Wade
 
How to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeHow to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeTim Wade
 

Viewers also liked (18)

TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval FrameworkTLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
TLDRet: A Temporal Semantic Facilitated Linked Data Retrieval Framework
 
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data RetrievalInclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
Inclusion of Temporal Semantics over Keyword-based Linked Data Retrieval
 
BoTLRet: A Template-based Linked Data Information Retrieval
 BoTLRet: A Template-based Linked Data Information Retrieval BoTLRet: A Template-based Linked Data Information Retrieval
BoTLRet: A Template-based Linked Data Information Retrieval
 
TESTING PT 1
TESTING PT 1TESTING PT 1
TESTING PT 1
 
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
From Fear To Victory! - Tim Wade at the Smart Investment & Property Investor ...
 
Savannah, ga 05
Savannah, ga 05Savannah, ga 05
Savannah, ga 05
 
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw..."Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
"Be a Productivity Powerhouse in 2011" - Tim Wade (ST701 2 Dec 2010) www.timw...
 
Prueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRICPrueba 1 en PWPAPP CRIC
Prueba 1 en PWPAPP CRIC
 
세오갤러리
세오갤러리세오갤러리
세오갤러리
 
An automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked dataAn automated template selection framework for keyword query over linked data
An automated template selection framework for keyword query over linked data
 
Client Case Studies
Client Case StudiesClient Case Studies
Client Case Studies
 
The formula for effective facebook ads
The formula for effective facebook adsThe formula for effective facebook ads
The formula for effective facebook ads
 
Tribus urbanas
Tribus urbanasTribus urbanas
Tribus urbanas
 
A Sacrifice
A SacrificeA Sacrifice
A Sacrifice
 
LiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word FinderLiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word Finder
 
2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!2012 Tim Wade slides - Leading Change? Yes We Can!
2012 Tim Wade slides - Leading Change? Yes We Can!
 
Initiation
InitiationInitiation
Initiation
 
How to increase Productivity - Tim Wade
How to increase Productivity - Tim WadeHow to increase Productivity - Tim Wade
How to increase Productivity - Tim Wade
 

Similar to inteSearch: An Intelligent Linked Data Information Access Framework

Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
Nikolaos Konstantinou
 
Recommender Systems in the Linked Data era
Recommender Systems in the Linked Data eraRecommender Systems in the Linked Data era
Recommender Systems in the Linked Data era
Roku
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
Muhammad Saleem
 
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
Databricks
 
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
Spark Summit
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 
Big data analysis using spark r published
Big data analysis using spark r publishedBig data analysis using spark r published
Big data analysis using spark r published
Dipendra Kusi
 
morph-LDP Demo
morph-LDP Demomorph-LDP Demo
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Databricks
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
Ian Foster
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
Victoria López
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic Web
Oana Tifrea-Marciuska
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
Jun Zhao
 
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
Gezim Sejdiu
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streams
keski
 
SparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at ScaleSparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at Scale
jeykottalam
 
Machine Learning with SparkR
Machine Learning with SparkRMachine Learning with SparkR
Machine Learning with SparkR
Olgun Aydın
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
Impetus Technologies
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
Vrije Universiteit Amsterdam
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
MLconf
 

Similar to inteSearch: An Intelligent Linked Data Information Access Framework (20)

Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Recommender Systems in the Linked Data era
Recommender Systems in the Linked Data eraRecommender Systems in the Linked Data era
Recommender Systems in the Linked Data era
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
 
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
From DataFrames to Tungsten: A Peek into Spark's Future @ Spark Summit San Fr...
 
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
From DataFrames to Tungsten: A Peek into Spark's Future-(Reynold Xin, Databri...
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federation
 
Big data analysis using spark r published
Big data analysis using spark r publishedBig data analysis using spark r published
Big data analysis using spark r published
 
morph-LDP Demo
morph-LDP Demomorph-LDP Demo
morph-LDP Demo
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic Web
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streams
 
SparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at ScaleSparkR: Enabling Interactive Data Science at Scale
SparkR: Enabling Interactive Data Science at Scale
 
Machine Learning with SparkR
Machine Learning with SparkRMachine Learning with SparkR
Machine Learning with SparkR
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
 

Recently uploaded

space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 

Recently uploaded (20)

space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 

inteSearch: An Intelligent Linked Data Information Access Framework

  • 1. inteSearch: An Intelligent Linked Data Information Access Framework Md-Mizanur Rahoman, Ryutaro Ichise November 11, 2014
  • 2. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Outline Introduction Background of Linked Data Information Access Problem and Probable Solution Proposed Retrieval Framework: inteSearch Pre-processing of Linked Data Framework Details Experiment Conclusion Md-Mizanur Rahoman, Ryutaro Ichise j 2
  • 3. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Linked Data (LD) are structured data represent knowledge with tuples like << Subject, Predicate, Object >> which called as RDF triples can be represented by graph can use SQL-like expressive query store, as openly available, 2122 datasets, 61 billion RDF triples (as of Apr. 2014) label type Property type type :birthPlace :supervisor :spouse Birth Place Supervisor Spouse label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances Md-Mizanur Rahoman, Ryutaro Ichise j 3
  • 4. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Information Access over LD It require sub-graph
  • 5. nding over LD graph impose sub-stantial execution cost, if graph size get bigger know-how of (dataset speci
  • 6. c) vocabulary, schema, LD query (i.e., linked data semantics) demand domain-level expertise expect automated tool to understand linked data semantics label type Property type type :birthPlace :supervisor :spouse Birth Place Supervisor Spouse label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances :spouse :dnld :birthPlacelabel :grce Donald label Greece Md-Mizanur Rahoman, Ryutaro Ichise j 4
  • 7. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Contemporary LD Information Access Systems Language-Tool-Based-Systems (PowerAqua'06, TBSL'12, FREyA'11, SemSek'12, CASIA'13 etc.) use language tools (e.g., parser, POS tagger etc.) to predict possible sub-graphs (over LD graph) convert sub-graphs to
  • 8. nd SPARQL query Pivot-Point-Based-Systems (Treo'11, NLP-Reduce'07 etc.) pick a query word (i.e., pivot point), then try to pick other query word w.r.t. the pivot point and predict a possible sub-graph (over LD graph) convert sub-graph to
  • 9. nd SPARQL query Md-Mizanur Rahoman, Ryutaro Ichise j 5
  • 10. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Language-Tool-Based-Systems Problem generate many improper parsed trees - dierent parser gives dierent parsed trees, with dierent parsing tags. tag for improper semantics (e.g., miss tagging of query words, such as whether query word spouse should be tagged for Object or Predicate) generate empty result or improper result - choosing incorrect sub-graph Md-Mizanur Rahoman, Ryutaro Ichise j 6
  • 11. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Pivot-Point-Based-Systems Problem depend heavily upon picking correct pivot point - most of the cases, systems pick NE (named entities) related pivot points
  • 12. rst, then other pivot points impose huge cost, if pivot point need to change - one pivot point can have multiple LD resources miss contextual information attachment e.g., random choosing of pivot points could generate very dierent result Md-Mizanur Rahoman, Ryutaro Ichise j 7
  • 13. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Problem Statement Probable Solution Problem Statement To LD information access, how can we
  • 14. nd the required sub-graph (over LD graph) within minimum execution cost that will not generate empty result will not miss contextual information of query Solution To
  • 15. nd correct sub-graph - check maximum possible sub-graph generation possibility To achieve minimum execute cost - prepare pre-processed LD statistics which insight sub-graph generation possibility To not lose contextual information of query - adapt a sub-graph joining technique called Progressive Joining Approach (Rahoman Ichise'14) Md-Mizanur Rahoman, Ryutaro Ichise j 8
  • 16. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion inteSearch - Overview Pre-processed data statistics store LD resources in a way so that they can be picked easily store pattern of LD resources so that they can give insight about possible sub-graph Development of framework generate single query word based graph (called as, Basic Graph) merge all Basic Graphs to predict all possible sub-graphs (i.e., called as Keyword Graphs) rank all possible Keyword Graphs using pre-processed data statistics generate SPARQL query for the best ranked Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 9
  • 17. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Pre-processed data statistics Label Extractor - extract and store label of LD resource lv (r ) = fo j 9 r ; p; o 2 RDF triples of dataset ^ p 2 rrp rrp is resource representing Predicates e.g., label, title etc.g Pattern-wise Resource Frequency Generator - compute and store LD resource pattern frequency sf (r ) = j f r ; p; o j 9 r ; p; o 2 RDF triples of datasetg j pf (r ) = j f s; r ; o j 9 s; r ; o 2 RDF triples of datasetg j of (r ) = j f s; p; r j 9 s; p; r 2 RDF triples of datasetg j Md-Mizanur Rahoman, Ryutaro Ichise j 10
  • 18. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Example of Pre-processed Data Statistics Exemplary LD graph Supervisor Spouse label type Property type type :birthPlace :supervisor :spouse Birth Place label label range domain domain range domainrange :Country :Person Country Person label label type Class type Schema/Ontology :amnd :barl :clra :dnld label label Amanda type :grmn :uk :grce Germany United Kingdom Greece Donald :spouse :supervisor :spouse :birthPlace :birthPlace :birthPlace :birthPlace label label label type Berlusconi Cleyra label label Instances Country label :Country type Class Pre-processed data statistics r lv (r ) sf(r) pf (r ) of (r ) :Country Country 2 ... ... :... ... ... ... ... Md-Mizanur Rahoman, Ryutaro Ichise j 11
  • 19. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Basic Graph Generator - generate the Basic Graphs Keyword Graph Generator - merge all Basic Graphs to predict the Keyword Graphs Ranker - rank all possible Keyword Graphs using pre-processed data statistics SPARQL Query Generator - generate SPARQL query for the best ranked Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 12
  • 20. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 13
  • 21. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Basic Graph Generator Choose one of the three Basic Graphs for each query word ?o ?p k ?s , or k k , or ?o ?p ?s decided by (particular) similar LD resources (toward the query word) and their pattern frequencies e.g., if (particular) similar LD resources fR g and Predicate Pattern-wise Resource Frequency of a LD resource (e.g., pf (ri )) is bigger than all Subject and Object Pattern-wise Resource Frequencies, then we select Basic Graph ?o k ?s weight computed by highest pattern frequencies of LD resources fR g Md-Mizanur Rahoman, Ryutaro Ichise j 14
  • 22. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 15
  • 23. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Keyword Graph Generator Merge all Basic Graphs in their all possible merging options by following Progressive Joining Approach e.g., merging 1st and 2nd Basic Graphs at all possible options k1 ?s ?o k ?p ?s 2 1st Basic Graph k 1 2nd Basic Graph k ?s1 2 , and ?s k ?o 1 1 k 2 ?p 2 1 ?o k ?s 1 1 k 2 ?p 2 1 Progressive Joining Approach - if query words with order fk1; k2; k3; :::; kmg, then join Basic Graph of k1 and Basic Graph of k2 and
  • 24. nd a Intermediate-version Keyword Graph, then progressively join next Basic Graph for remaining query words and update Intermediate-version Keyword Graph, until there is query word Progressive Joining Approach maintain contextual information attachment Md-Mizanur Rahoman, Ryutaro Ichise j 16
  • 25. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Progressive Joining Approach - an Example Intermediate-version Keyword Graph k ?p ?s 1 1 ?o 2 k2 1 ?p and Next query word corresponding Basic Graph k ?s 3 all possible contextualy-feasible Keyword Graph Intermediate Next BG Joining between Increase of KG Version KG last joined BG and next BG k ?p ?s 1 1 ?o 2 k2 ?p 1 k ?s 3 k k 2 ?s 3 1 ?s k ?o 2 1 k 3 ?p 3 2 ?o k ?s 2 2 k 3 ?p 3 1 k k 2 ?s 3 1 ?s k ?o 2 1 k 3 ?p 3 2 ?o k ?s 2 2 k 3 ?p 3 1 k1 k1 k1 Md-Mizanur Rahoman, Ryutaro Ichise j 17
  • 26. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 18
  • 27. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Ranker Rank Keyword Graphs for Weight - minimum weight of constituent Basic Graphs Depth level - how many edges a Keyword Graph holds Consider lower depth level Keyword Graphs with higher ranked than higher depth level Keyword Graphs Md-Mizanur Rahoman, Ryutaro Ichise j 19
  • 28. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Development of Framework Md-Mizanur Rahoman, Ryutaro Ichise j 20
  • 29. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion SPARQL Query Generator Construct SPARQL query for higher ranked Keyword Graphs, until get the
  • 30. rst non-empty result directly converted by putting Variables in SELECT clause merging keyword corresponding resources in UNION clause Md-Mizanur Rahoman, Ryutaro Ichise j 21
  • 31. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Experimental Setup Question setup Questions: Question Answering over Linked Data test question set 3(QALD-3) consist natural language questions Dataset Total Qs QALD-3 DBpedia 99 99 Keywords: constructed manually w.r.t. word order of question words Evaluation metrics Recall, Precision F1-Measure Evaluated for detail performance analysis, execution complexity measure, comparison with other systems Md-Mizanur Rahoman, Ryutaro Ichise j 22
  • 32. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Detail performance analysis Analyzed for number of keywords each question hold No of Qs Recall (Avg) Precision (Avg) F1 Measure (Avg) One Keyword Group 1 1.00 1.00 1.00 Two Keyword Group 45 0.90 0.96 0.92 Three Keyword Group 13 0.77 0.77 0.77 Four Keyword Group 8 0.75 0.75 0.75 Five Keyword Group 3 1.000 1.000 1.000 0.87 0.90 0.88 Observation according to One/Two/Three Keyword Group questions, selection of Basic Graph works well according to more-than-one Keyword Group questions, merging-based Keyword Graph construction and ranking works well pre-processed data statistics helps in ecient sub-graph
  • 33. nding over linked data graph Md-Mizanur Rahoman, Ryutaro Ichise j 23
  • 34. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Execution time wise performance analysis Environment Machine: Intel R CoreTMi7-4770K central processing unit (CPU) 3.50 GHz based system with 16 GB memory. Triple Store: Network-connected Virtuoso (version 06.01.3127) One Two Three Four Five Keyword Keyword Keyword Keyword Keyword Group Group Group Group Group 710 (ms) 2441 (ms) 2774 (ms) 3585 (ms) 3720 (ms) Observation execution cost linearly increase over number of keywords pre-processed data statistics supports in faster execution Md-Mizanur Rahoman, Ryutaro Ichise j 24
  • 35. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Performance Comparison Compared for QALD-3 challenge participant systems # of Questions Processed Right Partially Recall Precision F1-Measure squall2sparql 99 99 80 13 0.88 0.93 0.90 CASIA 99 52 29 8 0.36 0.35 0.36 Scalewelis 99 70 32 1 0.33 0.33 0.33 inteSearch 99 70 60 1 0.87 0.90 0.88 Observation: pre-processed data statistics helps in ecient sub-graph
  • 36. nding over linked data graph Md-Mizanur Rahoman, Ryutaro Ichise j 25
  • 37. Introduction Proposed Retrieval Framework: inteSearch Experiment Conclusion Conclusion IA over LD require
  • 38. nding proper sub-graph over LD graph We contributed devising LD IA framework that does not generate empty result maintain contextual information attachment retrieve rich information with low execution cost Single query word based Basic Graph can be extended for multiple query words, that can increase further eciency Md-Mizanur Rahoman, Ryutaro Ichise j 26
  • 39. Questions? Md-Mizanur Rahoman, mizan@nii.ac.jp Ryutaro Ichise, ichise@nii.ac.jp Md-Mizanur Rahoman, Ryutaro Ichise j 27