SlideShare a Scribd company logo
1 of 38
Optimized Index Structures
for Querying RDF from the
Web
Presented by : Mahdi Atawna
1
About the Paper
 The paper was published at the Third Latin
American Web Congress in 2005
 Have 56 citation.
Andreas Harth
National University
of Galway, Ireland
Prof. Stefan Decker
National University of
Galway, Ireland
2
Outline
 Overview of Semantic Web.
 Overview of Indexes.
 Paper motivation.
 Methodology.
 Experiment & Result.
 Conclusion.
3
Semantic Web
 Also called :
 Web 3.0.
 the Linked Data Web.
 the Web of Data…whatever you call it.
 the next major evolution in connecting information.
4
Why semantic web?
 It enables data to be linked from a source to any
other source.
 It can be understood by computers so that they can
perform increasingly sophisticated tasks on our
behalf.
5
6
Source: http://lod-cloud.net
Semantic Web Standards
 RDF (Resource Description Framework): The data modeling
language for the Semantic Web (like UML). All Semantic Web
information is stored and represented in the RDF.
 SPARQL : The query language of the Semantic Web.
 OWL (Web Ontology Language) The schema language, or
knowledge representation (KR) language, of the Semantic Web.
7
What is RDF?
 RDF is the data model of the Semantic Web.
 That means that all data in Semantic Web
technologies is represented as RDF.
 If you store Semantic Web data, it's in RDF.
 If you query Semantic Web data (typically using
SPARQL), it's RDF data. If you send Semantic Web
data to your friend, it's RDF.
8
9
10
Source : http://www.w3.org/TR/rdf11-primer/
RDF triples
 are representations of graph edges.
11
Subject Object
Predicate
Mahdi Hebron
born in
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
12
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
Subject
predicate
Object
13
Sparql query language
SELECT ?p ?o
{
<http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o
}
14
What is database index?
 A database index is a data structure that improves the
speed of data retrieval operations on a database table at
the cost of additional writes and storage space to maintain
the index data structure.
 Index goal : The index structure enables fast retrieval of
data
15
16
Index example
key articles
Leonardo [104,70,12,98]
Mona Lisa [2,201,7,20,12]
Francesco [1,8,900,104]
17
Paper motivation
 Previous Systems provide a storage infrastructure for RDF data, but
index structure which do not support typical query scenarios
for data from the Web which results in poor query answering
performance in some cases.
18
Methodology
 The researchers present a new index structure that handle the
data from the Web .
 Implemented the index structure in a lightweight software called
YARS
19
RDF Index structures
 The authors suggested an index structure that contains two
sets:
1. Lexicon : covers the string representation of RDF graph
(r,l,b)
2. Quad indexes : cover the quads (triples).
20
1. Lexicon indexes
 NodeOID and OIDNode Index
 Keyword Index
21
1. Lexicon indexes
 NodeOID and OIDNode Index :
22
Key value
<http://www.harth.org/andreas/#me> 3
<http://decker.cn/stefan/> 14
<http://sw.deri.org/ aharth/foaf.rdf> 11
<http://www.deri.org/> 1
1. Lexicon indexes
 Keyword Index (Popular in search engines)
23
Key No of hits List of hits
“Andreas” 1 3
“Decker” 1 11
”Harth” 1 3
“Stefan” 2 11,13
2. Quad indexes
 Access Patterns
 Combined Indexes
 Occurrence Counts
24
2. Quad indexes
A- Access Patterns
25
No Access pattern No Access pattern
1 (?:?:?:?) 9 (s:?:o:c)
2 (s:?:?:?) 10 (?:?:o:c)
3 (s:p:?:?) 11 (?:?:o:?)
4 (s:p:o:?) 12 (?:?:?:c)
5 (s:p:o:c) 13 (s:?:?:c)
6 (?:p:?:?) 14 (s:p:?:c)
7 (?:p:o:?) 15 (?:p:?:c)
8 (?:p:o:c) 16 (s:?:o:?)
2. Quad indexes
A- Access Patterns
26
No Access pattern values
1 (?:?:?:?) [1,5,3]
2 (s:?:?:?) [2]
3 (s:p:?:?) [9,8,2,3]
4 (s:p:o:?) [1,3]
5 (s:p:o:c) [1]
6 (?:p:?:?) [76,9]
7 (?:p:o:?) [2,3]
2. Quad indexes
B- Combined Indexes
27
spoc poc osc csp cp os
(?:?:?:?) (?:p:?:?) (?:?:o:?)
(?:?:?:c) (?:p:?:c) (s:?:o:?)
(s:?:?:?) (?:p:o:?) (?:?:o:c)
(s:?:?:c)
(s:p:?:?) (?:p:o:c) (s:?:o:c)
(s:p:?:c)
(s:p:o:?)
(s:p:o:c)
2. Quad indexes
C- Occurrence Counts
28
No
Access
pattern
values count
1 (?:?:?:?) [1,5,3] 3
2 (s:?:?:?) [2] 1
3 (s:p:?:?) [9,8,2,3] 4
4 (s:p:o:?) [1,3] 2
5 (s:p:o:c) [1] 1
6 (?:p:?:?) [76,9] 2
7 (?:p:o:?) [2,3] 2
YARS
 Web application that built in JAVA.
 Has two parts:
 a storage component that handles both persistent and in-
memory indexes.
 a query handler to perform query processing and evaluation.
29
Experiment
 They evaluated the performance based on a dataset of 2.8
million triples (293 MB).
 The testing server has :
 Pentium-4 2.4 GHz
 4 GB RAM
 running Debian Sarge .
30
Experiment
They considered the following RDF stores for evaluation:
 Sesame.
 Kowari (failed to get a running version).
 Redland.
 Jena2. ([9] shows that Sesame generally supersedes Jena in
performance results)
31
Experiment
Experiment parts:
 Index Construction.
 Queries.
32
Result – index construction
System Index size (bytes)
Redland 2.164.019.200
Sesame MySQL 340.381.636
Sesame native 39.997.992
YARS 1.090.002.944
33
Table 8: Index size for the synthetic Univ20 dataset.
Result – index construction
34
Result - queries
No Query
1 ?x rdf:type univ:UndergraduateStudent
2 ?x ?p ”UndergraduateStudent0”
3 <http://www.University965.edu> ?p ?o
4 ?x univ:worksFor ?y
35
Query Redland Sesame MySQL Sesame Native YARS
1 0:10.48 0:18.87 1:05.16 0:18.41
2 0:44.14 0:00.73 0:00.55 0:00.49
3 0:44.15 0:00.46 0:00.47 0:00.32
4 3:04.21 0:03.42 0:01.95 0:00.47
Performance results for quad queries.
Conclusion
 The auther introduced query processing for RDF which is an I
portant issue in sematic web.
 YARS has some overhead for resolving the dependencies and
order in comparison with others.
36
Criticism
 - In experiment , the researchers removed “Kowari “ engine
because the cannot install it.
37
Questions?
Thank you
38

More Related Content

What's hot

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
R Programming Language
R Programming LanguageR Programming Language
R Programming LanguageNareshKarela1
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-iDr. Awase Khirni Syed
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language Rfbenault
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationINRIA-OAK
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming FundamentalsRagia Ibrahim
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesLeonidas Akritidis
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RYanchang Zhao
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul SinghRavi Basil
 

What's hot (20)

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil
 
R Programming Language
R Programming LanguageR Programming Language
R Programming Language
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language R
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
Data Structure
Data StructureData Structure
Data Structure
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted Indexes
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in R
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul Singh
 

Similar to Optimized index structures for querying rdf from the web

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3CIvan Herman
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Jose Emilio Labra Gayo
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyAlbert Meroño-Peñuela
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityDatabricks
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptxAndrew Lamb
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?Örjan Lundberg
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهWeb Standards School
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_frameworksamuelhard
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureMasud Rahman
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkDatabricks
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET FrameworkAdriana Ivanciu
 

Similar to Optimized index structures for querying rdf from the web (20)

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3C
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic Study
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark community
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptx
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_framework
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lecture
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with Spark
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET Framework
 

More from Mahdi Atawneh

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logsMahdi Atawneh
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsMahdi Atawneh
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreMahdi Atawneh
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesMahdi Atawneh
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxMahdi Atawneh
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model databaseMahdi Atawneh
 

More from Mahdi Atawneh (6)

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logs
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristics
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value Store
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptx
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model database
 

Recently uploaded

Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 

Recently uploaded (20)

Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 

Optimized index structures for querying rdf from the web

  • 1. Optimized Index Structures for Querying RDF from the Web Presented by : Mahdi Atawna 1
  • 2. About the Paper  The paper was published at the Third Latin American Web Congress in 2005  Have 56 citation. Andreas Harth National University of Galway, Ireland Prof. Stefan Decker National University of Galway, Ireland 2
  • 3. Outline  Overview of Semantic Web.  Overview of Indexes.  Paper motivation.  Methodology.  Experiment & Result.  Conclusion. 3
  • 4. Semantic Web  Also called :  Web 3.0.  the Linked Data Web.  the Web of Data…whatever you call it.  the next major evolution in connecting information. 4
  • 5. Why semantic web?  It enables data to be linked from a source to any other source.  It can be understood by computers so that they can perform increasingly sophisticated tasks on our behalf. 5
  • 7. Semantic Web Standards  RDF (Resource Description Framework): The data modeling language for the Semantic Web (like UML). All Semantic Web information is stored and represented in the RDF.  SPARQL : The query language of the Semantic Web.  OWL (Web Ontology Language) The schema language, or knowledge representation (KR) language, of the Semantic Web. 7
  • 8. What is RDF?  RDF is the data model of the Semantic Web.  That means that all data in Semantic Web technologies is represented as RDF.  If you store Semantic Web data, it's in RDF.  If you query Semantic Web data (typically using SPARQL), it's RDF data. If you send Semantic Web data to your friend, it's RDF. 8
  • 9. 9
  • 11. RDF triples  are representations of graph edges. 11 Subject Object Predicate Mahdi Hebron born in
  • 12. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . 12
  • 13. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . Subject predicate Object 13
  • 14. Sparql query language SELECT ?p ?o { <http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o } 14
  • 15. What is database index?  A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure.  Index goal : The index structure enables fast retrieval of data 15
  • 16. 16
  • 17. Index example key articles Leonardo [104,70,12,98] Mona Lisa [2,201,7,20,12] Francesco [1,8,900,104] 17
  • 18. Paper motivation  Previous Systems provide a storage infrastructure for RDF data, but index structure which do not support typical query scenarios for data from the Web which results in poor query answering performance in some cases. 18
  • 19. Methodology  The researchers present a new index structure that handle the data from the Web .  Implemented the index structure in a lightweight software called YARS 19
  • 20. RDF Index structures  The authors suggested an index structure that contains two sets: 1. Lexicon : covers the string representation of RDF graph (r,l,b) 2. Quad indexes : cover the quads (triples). 20
  • 21. 1. Lexicon indexes  NodeOID and OIDNode Index  Keyword Index 21
  • 22. 1. Lexicon indexes  NodeOID and OIDNode Index : 22 Key value <http://www.harth.org/andreas/#me> 3 <http://decker.cn/stefan/> 14 <http://sw.deri.org/ aharth/foaf.rdf> 11 <http://www.deri.org/> 1
  • 23. 1. Lexicon indexes  Keyword Index (Popular in search engines) 23 Key No of hits List of hits “Andreas” 1 3 “Decker” 1 11 ”Harth” 1 3 “Stefan” 2 11,13
  • 24. 2. Quad indexes  Access Patterns  Combined Indexes  Occurrence Counts 24
  • 25. 2. Quad indexes A- Access Patterns 25 No Access pattern No Access pattern 1 (?:?:?:?) 9 (s:?:o:c) 2 (s:?:?:?) 10 (?:?:o:c) 3 (s:p:?:?) 11 (?:?:o:?) 4 (s:p:o:?) 12 (?:?:?:c) 5 (s:p:o:c) 13 (s:?:?:c) 6 (?:p:?:?) 14 (s:p:?:c) 7 (?:p:o:?) 15 (?:p:?:c) 8 (?:p:o:c) 16 (s:?:o:?)
  • 26. 2. Quad indexes A- Access Patterns 26 No Access pattern values 1 (?:?:?:?) [1,5,3] 2 (s:?:?:?) [2] 3 (s:p:?:?) [9,8,2,3] 4 (s:p:o:?) [1,3] 5 (s:p:o:c) [1] 6 (?:p:?:?) [76,9] 7 (?:p:o:?) [2,3]
  • 27. 2. Quad indexes B- Combined Indexes 27 spoc poc osc csp cp os (?:?:?:?) (?:p:?:?) (?:?:o:?) (?:?:?:c) (?:p:?:c) (s:?:o:?) (s:?:?:?) (?:p:o:?) (?:?:o:c) (s:?:?:c) (s:p:?:?) (?:p:o:c) (s:?:o:c) (s:p:?:c) (s:p:o:?) (s:p:o:c)
  • 28. 2. Quad indexes C- Occurrence Counts 28 No Access pattern values count 1 (?:?:?:?) [1,5,3] 3 2 (s:?:?:?) [2] 1 3 (s:p:?:?) [9,8,2,3] 4 4 (s:p:o:?) [1,3] 2 5 (s:p:o:c) [1] 1 6 (?:p:?:?) [76,9] 2 7 (?:p:o:?) [2,3] 2
  • 29. YARS  Web application that built in JAVA.  Has two parts:  a storage component that handles both persistent and in- memory indexes.  a query handler to perform query processing and evaluation. 29
  • 30. Experiment  They evaluated the performance based on a dataset of 2.8 million triples (293 MB).  The testing server has :  Pentium-4 2.4 GHz  4 GB RAM  running Debian Sarge . 30
  • 31. Experiment They considered the following RDF stores for evaluation:  Sesame.  Kowari (failed to get a running version).  Redland.  Jena2. ([9] shows that Sesame generally supersedes Jena in performance results) 31
  • 32. Experiment Experiment parts:  Index Construction.  Queries. 32
  • 33. Result – index construction System Index size (bytes) Redland 2.164.019.200 Sesame MySQL 340.381.636 Sesame native 39.997.992 YARS 1.090.002.944 33 Table 8: Index size for the synthetic Univ20 dataset.
  • 34. Result – index construction 34
  • 35. Result - queries No Query 1 ?x rdf:type univ:UndergraduateStudent 2 ?x ?p ”UndergraduateStudent0” 3 <http://www.University965.edu> ?p ?o 4 ?x univ:worksFor ?y 35 Query Redland Sesame MySQL Sesame Native YARS 1 0:10.48 0:18.87 1:05.16 0:18.41 2 0:44.14 0:00.73 0:00.55 0:00.49 3 0:44.15 0:00.46 0:00.47 0:00.32 4 3:04.21 0:03.42 0:01.95 0:00.47 Performance results for quad queries.
  • 36. Conclusion  The auther introduced query processing for RDF which is an I portant issue in sematic web.  YARS has some overhead for resolving the dependencies and order in comparison with others. 36
  • 37. Criticism  - In experiment , the researchers removed “Kowari “ engine because the cannot install it. 37