Within the course, we will present Linked Data as a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the past years, leading to the creation of a global data space that contains many billions of assertions – the Web of Linked Data.
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
Theoretical and practical introducton to linked data, focusing both on the value proposition, the theory/foundations, and on practical examples. The material is tailored to the context of the EU institutions.
This module supported the training on Linked Open Data delivered to the EU Institutions on 30 November 2015 in Brussels. https://joinup.ec.europa.eu/community/ods/news/ods-onsite-training-european-commission
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
Theoretical and practical introducton to linked data, focusing both on the value proposition, the theory/foundations, and on practical examples. The material is tailored to the context of the EU institutions.
This module supported the training on Linked Open Data delivered to the EU Institutions on 30 November 2015 in Brussels. https://joinup.ec.europa.eu/community/ods/news/ods-onsite-training-european-commission
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
An introduction to linked data (semantic web) for a Knowledge and Information Network (KIN) webinar. The presentation shows some examples of linked data in action, data visualization, difference between open and linked data and how linkd data is being used in UK gov and local gov.
Big Linked Data - Creating Training CurriculaEUCLID project
This presentation includes an overview of the basic rules to follow when developing training and education curricula for Linked Data and Big Linked Data
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
This presentation introduces the main principles of Linked Data, the underlying technologies and background standards. It provides basic knowledge for how data can be published over the Web, how it can be queried, and what are the possible use cases and benefits. As an example, we use the development of a music portal (based on the MusicBrainz dataset), which facilitates access to a wide range of information and multimedia resources relating to music.
PoolParty Semantic Suite: Management Briefing and Functional Overview Martin Kaltenböck
Slides for the presentation of PoolParty Semantic Suite on 12.11. 2015 at KNVI Congres 2015 in Utrecht, the Netherlands, see: http://congres.knvi.info/ by Martin Kaltenböck in the Big Data & Linked Data Session.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Slides from our tutorial on Linked Data generation in the energy domain, presented at the Sustainable Places 2014 conference on October 2nd in Nice, France
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
An introduction to linked data (semantic web) for a Knowledge and Information Network (KIN) webinar. The presentation shows some examples of linked data in action, data visualization, difference between open and linked data and how linkd data is being used in UK gov and local gov.
Big Linked Data - Creating Training CurriculaEUCLID project
This presentation includes an overview of the basic rules to follow when developing training and education curricula for Linked Data and Big Linked Data
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
This presentation introduces the main principles of Linked Data, the underlying technologies and background standards. It provides basic knowledge for how data can be published over the Web, how it can be queried, and what are the possible use cases and benefits. As an example, we use the development of a music portal (based on the MusicBrainz dataset), which facilitates access to a wide range of information and multimedia resources relating to music.
PoolParty Semantic Suite: Management Briefing and Functional Overview Martin Kaltenböck
Slides for the presentation of PoolParty Semantic Suite on 12.11. 2015 at KNVI Congres 2015 in Utrecht, the Netherlands, see: http://congres.knvi.info/ by Martin Kaltenböck in the Big Data & Linked Data Session.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Slides from our tutorial on Linked Data generation in the energy domain, presented at the Sustainable Places 2014 conference on October 2nd in Nice, France
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...JohannWanja
The choice of which vocabulary to reuse when modeling and publishing Linked Open Data (LOD) is far from trivial. There is no study that investigates the different strategies of reusing vocabularies for LOD modeling and publishing. In this paper, we present the results of a survey with 79 participants that examines the most preferred vocabulary reuse strategies of LOD modeling. The participants, LOD publishers and practitioners, were asked to assess different vocabulary reuse strategies and explain their ranking decision. We found significant differences between the modeling strategies that range from reusing popular vocabularies, minimizing the number of vocabularies, and staying within one domain vocabulary. A very interesting insight is that the popularity in the meaning of how frequent a vocabulary is used in a data source is more important than how often individual classes and properties are used in the LOD cloud. Overall, the results of this survey help in better understanding the strategies how data engineers reuse vocabularies and may also be used to develop future vocabulary engineering tools.
A few metrics about Open Data in the cultural sectorJoris Pekel
Presentation at the Open Knowledge Conference in Geneva. Here I talked about the importance of good quality metadata and open licenses in order to get institutions data to be found, and included some metrics.
Publishing Linked Open Data in 15 minutesAlvaro Graves
In this presentation I will show why Linked Open Data is the best technique available to publish government data and how can you use LODSPeaKr, a simple kit for publishing Linked Data, to create from prototypes in minutes to Open Data Portals, APIs and mobile webapps.
What is pattern recognition (lecture 4 of 6)Randa Elanwar
In this series I intend to simplify a beautiful branch of computer science that we as humans use it in everyday life without knowing. Pattern recognition is a sub-branch of the computer vision research and is tightly related to digital signal processing research as well as machine learning and artificial intelligence.
Presentation delivered by Ludo Hendrickx and Joris Beek on 11 December 2013 Dutch at the Ministry of Interior, The Hague, The Netherlands. More information on: https://joinup.ec.europa.eu/community/ods/description
In this project, we study the classification problem and compare some traditional statistical models with neural networks. This work was done in the frame of postgraduate programme in Web Science at Department of Mathematics, Aristotle University of Thessaloniki
Web of Data and its Status on Persian Web Data SpaceAli Khalili
Linked Data as a step to realization of Semantic Web vision consists of a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the past years, leading to the creation of a global data space that contains many billions of assertions – the Web of Linked Data. Recently, Linked Data has received attention in domains such as libraries, e-government, e-commerce, search, news providers, e-learning and other data integration applications. Nonetheless, looking at the current status of Linked Data on Persian Web space, adoption has been very limited and data is trapped in many Persian Websites without allowing the integration and reasoning. One reason for this, is the lack of infrastructure and NLP tools to deal with the specific requirements of Persian data. This talk will introduce the Linked (Open) Data principles, its lifecycle and the required steps towards its realization on Persian Web data space.
Linked Data is one of the core concepts and pillars of the Semantic Web. The Semantic Web is all about making links between islands of data in a way that is understandable not only to humans but also to machines. Due to the increasing amount of Linked Data openly published on the Web, user-facing Linked Data Applications (LDAs) are gaining momentum. One of the major entrance barriers for end-users and Web developers to benefit from this wave of LDAs is the required knowledge of Semantic Web technologies such as the RDF data model and SPARQL query language. In this talk, I will discuss the emerging field of Human-Linked Data Interaction to identify and address existing issues related to interacting with Linked Data scattered over multiple knowledge graphs. In particular, I will present the WYSIWYM (What-You-See-Is-What You Mean) concept as a binding between semantic representation models and user interface (UI) elements for authoring, visualizing and exploring Linked Data. As an example implementation of the WYSIWYM model, I will showcase the Linked Data Reactor (http://ld-r.org) software framework for building adaptive and flexible Linked Data UIs. In order to mention some of the use cases of adaptive Linked Data applications, I will present the SMS (Semantically Mapping Science) platform which helps social scientists to find answers to their research questions related to the field of science, technology and innovation studies.
In this webinar, we look at how you obtain and use open data, the key role of search engines and how you establish rust in the data you find. The webinar will also look at the quality of data and how to clean and prepare data for analysis. Finally, the session will look at how you can quickly visualise cleaned data and the applications of this in the agriculture sector.
Lecture at the advanced course on Data Science of the SIKS research school, May 20, 2016, Vught, The Netherlands.
Contents
-Why do we create Linked Open Data? Example questions from the Humanities and Social Sciences
-Introduction into Linked Open Data
-Lessons learned about the creation of Linked Open Data (link discovery, knowledge representation, evaluation).
-Accessing Linked Open Data
Resources and Lessons on Open Data from the World Banktariqkhokhar
Delivered by Tariq Khokhar at the European Association of Development Research and Training Institute's Information Management Working Group Conference in Antwerp, Belgium on September 13th, 2012.
A short introduction to the world of open data and the opportunities it creates. The slides are from my presentation at the GoOpen 2009 conference in Oslo, Norway.
This presentation was given at the WDCC Meetup Practical Applications of Linked Data on September 26th at the Wageningen University & Research (WUR).
The intention of this presentation was to give the audience an idea of how Linked Data works and what the role of Linked Data can be for better cross border and cross disciplinary research and more open and better connected research data when you want e.g. to build an international open research data infrastructure like EOSC using a GO FAIR approach.
#LinkedData #OpenAcess #OpenScience #OpenResearchData #interoperabilty #connectivity #DataSharing #SmartCollaboration #NoUnnecessaryDataCopies #RDF #triples #URIs #PIDs #taxonomies #thesauri #ontologies #vocabularies #SKOS #RDFS #OWL #SHACL #SPARQL #OpenAPIs #REST #KnowledgeGraphs #DataClouds #CrossBorder #CrossDomain #CrossDisciplinary #FAIR #GOFAIR #FAIRification #FAIRifier #FDPs #FAIRDataPoints #IFDS #InternetOfFAIRDataAndServices #EOSC #EuropeanOpenScienceCloud #Solid #PODS #DataOwnership #GDPR #AVG #CompatibleDataShapes #MetadataShapes
LinuxCon 2010 Education Mini-Summit: The State of Open Data in Educationcomputercolin
A call for more, open data in education so we can foster innovative applications, better tools for teaching, and tons of interesting applications we haven't even thought of.
Similar to An introduction to Linked (Open) Data (20)
FERASAT: A Serendipity-Fostering Faceted Browser for Linked DataAli Khalili
Accidental knowledge discoveries occur most frequently during capricious and unplanned search and browsing of data. This type of undirected, random, and exploratory search and browsing of data results in Serendipity – the art of unsought finding. In our previous work we extracted a set of serendipity-fostering design features for developing intelligent user interfaces on Semantic Web and Linked Data browsing environments. The features facilitate the discovery of interesting and valuable facts in (linked) data which were not initially sought for. In this work, we present an implementation of those features called FERASAT. FERASAT provides an adaptive multigraph-based faceted browsing interface to catalyze serendipity while browsing Linked Data. FERASAT is already in use within the domain of science, technology & innovation (STI) studies to allow researchers who are not familiar with Linked Data technologies to explore heterogeneous interlinked datasets in order to observe and interpret surprising facts from the data relevant to policy and innovation studies. In addition to an analysis of the related work, we describe two STI use cases in the paper and demonstrate how different serendipity design features are addressed in those use cases.
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...Ali Khalili
Due to the increasing amount of Linked Data openly published on the Web, user-facing Linked Data Applications (LDAs) are gaining momentum. One of the major entrance barriers for Web developers to contribute to this wave of LDAs is the required knowledge of Semantic Web technologies such as the RDF data model and SPARQL query language. This paper presents an adaptive component-based approach together with its open source implementation for creating flexible and reusable Semantic Web interfaces driven by Linked Data. Linked Data-driven (LD-R) Web components abstract the complexity of the underlying Semantic Web technologies in order to allow reuse of existing Web components in LDAs, enabling Web developers who are not experts in Semantic Web to develop interfaces that view, edit and browse Linked Data. In addition to the modularity provided by the LD-R components, the proposed RDF-based configuration method allows application assemblers to reshape their user interface for different use cases, by either reusing existing shared configurations or by creating their proprietary configurations.
A Semantics-based User Interface Model for Content Annotation, Authoring and ...Ali Khalili
A Semantics-based User Interface Model for Content Annotation, Authoring and Exploration: My PhD defense slides
full version of thesis:
http://svn.aksw.org/papers/2014/Thesis_Ali/public.pdf
conTEXT -- Lightweight Text Analytics using Linked DataAli Khalili
The Web democratized publishing -- everybody can easily publish information on a Website, Blog, in social networks or microblogging systems. The more the amount of published information grows, the more important are technologies for accessing, analysing, summarising and visualising information. While substantial progress has been made in the last years in each of these areas individually, we argue, that only the intelligent combination of approaches will make this progress truly useful and leverage further synergies between techniques. In this paper we develop a text analytics architecture of participation, which allows ordinary people to use sophisticated NLP techniques for analysing and visualizing their content, be it a Blog, Twitter feed, Website or article collection. The architecture comprises interfaces for information access, natural language processing and visualization. Dierent exchangeable components can be plugged into this architecture, making it easy to tailor for individual needs. We evaluate the usefulness of our approach by comparing both the eectiveness and eciency of end users within a task-solving setting. Moreover, we evaluate the usability of our approach using a questionnaire-driven approach. Both evaluations suggest that oridinary Web users are empowered to analyse their data and perform tasks, which were previously out of reach.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
The affect of service quality and online reviews on customer loyalty in the E...
An introduction to Linked (Open) Data
1. An Introduction to
Linked (Open) Data
Ali Khalili, PhD
Knowledge Representation & Reasoning Group
VU University Amsterdam
2. Why do we need the Data Web?
Linked Open Data 2Ali Khalili
Linked Open Data
3. Why do we need the Data Web?
Linked Open Data 2Ali Khalili
Linked Open Data
Problem: Try to search for these things on the current Web:
• Apartments near Dutch-English bilingual childcare in Amsterdam
4. Why do we need the Data Web?
Linked Open Data 2Ali Khalili
Linked Open Data
Problem: Try to search for these things on the current Web:
• Apartments near Dutch-English bilingual childcare in Amsterdam
Web
Server
funda.nl kindergarden.nl
Has data about childcare
in Amsterdam.
Knows about real
estate offers in the
Netherlands.
DBDB
HTMLHTML
Web
Server
5. Why do we need the Data Web?
Linked Open Data 2Ali Khalili
Linked Open Data
Problem: Try to search for these things on the current Web:
• Apartments near Dutch-English bilingual childcare in Amsterdam
Web
Server
funda.nl kindergarden.nl
Has data about childcare
in Amsterdam.
Knows about real
estate offers in the
Netherlands.
DBDB
HTMLHTML
Web
Server
RDF
6. Why do we need the Data Web? (con.)
Linked Open Data 3Ali Khalili
Linked Open Data
• Researchers working on Semantic Web topics in the Middle East.
• Side effects of some drugs with a specific chemical compound
prescribed for certain diseases.
• Who are mayors of central European towns elevated more than 1000m?
• Which movies are starring both Brad Pitt and Angelina Jolie?
• All soccer players, who played as goalkeeper for a club that has a
stadium with more than 40.000 seats and who are born in a country with
more than 10 million inhabitants
• …
7. Why do we need the Data Web? (con.)
Linked Open Data 3Ali Khalili
Linked Open Data
• Researchers working on Semantic Web topics in the Middle East.
• Side effects of some drugs with a specific chemical compound
prescribed for certain diseases.
• Who are mayors of central European towns elevated more than 1000m?
• Which movies are starring both Brad Pitt and Angelina Jolie?
• All soccer players, who played as goalkeeper for a club that has a
stadium with more than 40.000 seats and who are born in a country with
more than 10 million inhabitants
• …
Information is available on the Web,
but opaque to current search.
8. Evolution of the Web
Linked Open Data 4Ali Khalili
Linked Open Data
https://mcgratha.wordpress.com/
9. Evolution of the Web
Linked Open Data 4Ali Khalili
Linked Open Data
https://mcgratha.wordpress.com/
1998: Semantic Web Road map
10. Evolution of the Web
Linked Open Data 4Ali Khalili
Linked Open Data
https://mcgratha.wordpress.com/
1998: Semantic Web Road map
2006: emergence of Data Web
11. Linked Data
Linked Open Data 5Ali Khalili
Linked Open Data
• A set of best practices for publishing data on the Web.
12. Linked Data
Linked Open Data 5Ali Khalili
Linked Open Data
• A set of best practices for publishing data on the Web.
• Follows 4 simple principles:
1. Use URIs as names (identifiers) for conceptual things.
2. Use HTTP URIs so that users can look up (dereference) those
names.
3. When someone looks up a URI, provide useful information,
using the open standards such as RDF, SPARQL, etc.
4. Include links to other URIs, so that users can discover more
things.
18. 5 Open Data
Linked Open Data 7Ali Khalili
Linked Open Data
make your stuff available on the Web (whatever format)
under an open license
make it available as structured data
(e.g., Excel instead of image scan of a table)
make it available in a non-proprietary open format
(e.g., CSV as well as of Excel)
use Linked Data format
(URIs to identify things, RDF to represent data)
link your data to other people’s data to provide context
http://5stardata.info/
31. Linked Open Data 12Ali Khalili
Linked Open Data
Ranking of universities Amount of funding acquired by universities?
Hands-on
32. Linked Open Data 12Ali Khalili
Linked Open Data
Ranking of universities Amount of funding acquired by universities?
Hands-on
Make a sorted list of all universities in the Netherlands
displaying their recent rankings (e.g. CWTS Leiden Ranking or
Times World University Rankings) together with the amount of
EC contributions on H2020 projects?
33. Linked Open Data 13Ali Khalili
Linked Open Data
Interlinking
Enrichment
Quality
Analysis
Evolution
Exploration
Extraction
Storage/
Querying
Authoring
Linked (Open) Data
Lifecycle
http://stack.linkeddata.org/
34. Linked Open Data 14Ali Khalili
Linked Open Data Lifecycle
Exploration
35. Linked Open Data 14Ali Khalili
Linked Open Data Lifecycle
• Search
• Browse
• Visualize
Exploration
36. Search for Linked Data
Linked Open Data 15Ali Khalili
Linked Open Data Lifecycle Exploration
37. Search for Linked Data
Linked Open Data 15Ali Khalili
Linked Open Data Lifecycle Exploration
http://swoogle.umbc.edu/
38. Search for Linked Data
Linked Open Data 16Ali Khalili
Linked Open Data Lifecycle Exploration
39. Search in Linked Data
Linked Open Data 17Ali Khalili
Linked Open Data Lifecycle Exploration
http://www.wolframalpha.com/
Question Answering Systems
40. Search for Linked Data
Linked Open Data 18Ali Khalili
Linked Open Data Lifecycle Exploration
http://lov.okfn.org/
41. Search for Linked Data
Linked Open Data 19Ali Khalili
Linked Open Data Lifecycle Exploration
http://lotus.lodlaundromat.org
42. Search for Linked Data
Linked Open Data 20Ali Khalili
Linked Open Data Lifecycle Exploration
http://demo.ld-r.org/lotus
43. Search for Linked Data
Linked Open Data 21Ali Khalili
Linked Open Data Lifecycle
Data hub http://datahub.io
search for data, register published datasets, create and manage groups of datasets…
Exploration
44. Search for/in Linked Data
Linked Open Data 22Ali Khalili
Linked Open Data Lifecycle Exploration
45. Search for/in Linked Data
Linked Open Data 22Ali Khalili
Linked Open Data Lifecycle Exploration
46. Search for/in Linked Data
Linked Open Data 23Ali Khalili
Linked Open Data Lifecycle Exploration
https://www.google.com/cse/
47. Search for/in Linked Data
Linked Open Data 24Ali Khalili
Linked Open Data Lifecycle Exploration
http://schema.org/
http://bl.ocks.org/danbri/1c121ea8bd2189cf411c
48. Browsing Linked Data
Linked Open Data 25Ali Khalili
Linked Open Data Lifecycle
Loupe http://loupe.linkeddata.es/
discovering the type of data contained in a dataset, its structure, and the vocabularies used…
Exploration
49. Browsing Linked Data
Linked Open Data 26Ali Khalili
Linked Open Data Lifecycle
Linked Data Reactor http://ld-r.org
a component-based framework to view, browse and edit Linked Data
Exploration
50. Browsing Linked Data
Linked Open Data 27Ali Khalili
Linked Open Data Lifecycle
Exhibit http://www.simile-widgets.org/exhibit/
Exploration
51. Visualizing Linked Data
Linked Open Data 28Ali Khalili
Linked Open Data Lifecycle Exploration
Using existing visualisations for structured data
e.g. Sgvizler http://dev.data2000.no/sgvizler/
52. Visualizing Linked Data
Linked Open Data 29Ali Khalili
Linked Open Data Lifecycle Exploration
Using graph-based visualizations
e.g. RelFinder http://www.visualdataweb.org/relfinder/relfinder.php
53. Hands-on
Linked Open Data 30Ali Khalili
Linked Open Data Lifecycle Exploration
1. List of all universities in the Netherlands
2. University Rankings (e.g. CWTS Leiden Ranking or Times
World University Rankings)
3. H2020 EU projects repository
Explore Web of Data for the following data sources:
55. from Unstructured sources
Linked Open Data 32Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
56. from Unstructured sources
Linked Open Data 32Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
Named Entity Recognition
57. from Unstructured sources
Linked Open Data 32Ali Khalili
Linked Open Data Lifecycle Extraction
…After leaving Apple, Jobs took a few of its members with him to
found NeXT, a computer platform development company based in
Redwood City, specializing in state-of-the-art computers for higher-
education and business markets. In addition, Jobs helped to initiate
the development of the visual effects industry when he funded the
spinout of the computer graphics division of George Lucas's
company Lucasfilm in 1986. The new company, Pixar, would
eventually produce the first fully computer-animated film, Toy Story…
NLP, Text mining, Annotation
Named Entity Recognition
foundedBy
Relation Extraction
58. Named Entity Recognition
Linked Open Data 33Ali Khalili
Linked Open Data Lifecycle Extraction
http://spotlight.dbpedia.org
http://bioportal.bioontology.org/annotator
59. Visualizing Linked Data
Linked Open Data 34Ali Khalili
Linked Open Data Lifecycle Exploration
Contextual visualizations
e.g. Zemanta https://wordpress.org/plugins/zemanta
60. Linked Open Data 35Ali Khalili
Linked Open Data Lifecycle Extraction
http://nlp2rdf.org
Interoperability between NLP tools, language
resources and annotations
NLP Interchange Format (NIF)
Named Entity Recognition
& Disambiguation (NERD)
http://nerd.eurecom.fr/
63. Multi-lingual
• 38.3m things in 125 languages
• 23.8m are localized descriptions of things in the English DBpedia
• English DBpedia describes 4.58m things (68,091,260 statements)
Multi-domain
1,445,000 persons, 735,000 places (including 478,000 populated places), 411,000 creative works
(including 123,000 music albums, 87,000 films and 19,000 video games), 241,000 organizations
(including 58,000 companies and 49,000 educational institutions), 251,000 species and 6,000 diseases
Linked Open Data 37Ali Khalili
Linked Open Data Lifecycle Extraction DBpedia
64. Multi-lingual
• 38.3m things in 125 languages
• 23.8m are localized descriptions of things in the English DBpedia
• English DBpedia describes 4.58m things (68,091,260 statements)
Multi-domain
1,445,000 persons, 735,000 places (including 478,000 populated places), 411,000 creative works
(including 123,000 music albums, 87,000 films and 19,000 video games), 241,000 organizations
(including 58,000 companies and 49,000 educational institutions), 251,000 species and 6,000 diseases
Linked Open Data 37Ali Khalili
Linked Open Data Lifecycle Extraction DBpedia
65. Linked Open Data 38Ali Khalili
Linked Open Data Lifecycle Extraction
• Ad-hoc
• DBpedia extraction framework
• Generic
• OpenRefine
from Semi-structured sources
67. from Structured sources
Linked Open Data 39Ali Khalili
Linked Open Data Lifecycle Extraction
Triplification by Materialization
68. from Structured sources
Linked Open Data 40Ali Khalili
Linked Open Data Lifecycle Extraction
Triplification by SPARQL-to-SQL-Rewriting
69. Triplification
Linked Open Data 41Ali Khalili
Linked Open Data Lifecycle Extraction
• Relational Database to RDF
R2RML: RDB to RDF Mapping Language
http://www.w3.org/TR/r2rml/
• D2R Server: Accessing databases with SPARQL &
as Linked Data
http://d2rq.org/
• Sparqlify
defining RDF views on relational databases
http://sparqlify.org/
70. Hands-on
Linked Open Data 42Ali Khalili
Linked Open Data Lifecycle Extraction
1. University Rankings (e.g. CWTS Leiden Ranking or Times
World University Rankings)
2. H2020 EU projects repository
Triplify the following data sources:
72. Relational Databases vs. Triple Stores
Linked Open Data 44Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• A relational databases’ (e.g. MySQL, PostgreSQL, Oracle)
natural representation is a collection interlinked tables.
• A triple stores’ (e.g. OpenSesame, AllegroGraph, Neo4j)
natural representation is a multi-relational network, or graph.
73. Relational Databases vs. Triple Stores
Linked Open Data 45Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Relational databases tend to not maintain public
access points (some might provide Web APIs).
• Triple stores maintain public access points called
SPARQL end-points.
• Relational database users tend to not publish their
schemas.
• Triple store users tend to reuse and extend public
schemas called ontologies.
74. Existing Triple Stores
Linked Open Data 46Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Native triple stores
4Store, AllegroGraph, BigData, Jena TDB, Sesame,
Stardog, OWLIM and uRiKa
• RDBMS-backed triple stores
Jena SDB, IBM DB2 and OpenLink Virtuoso
• NoSQL triplestores
CumulusRDF
75. SPARQL – SQL for the Linked Data
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
76. SPARQL – SQL for the Linked Data
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
• SPARQL queries are considerably better aligned with users’ mental
models of a domain.
77. SPARQL – SQL for the Linked Data
Linked Open Data 47Ali Khalili
Linked Open Data Lifecycle Storage/Querying
What can be done with SPARQL that can't be done with SQL?
• SPARQL queries are considerably better aligned with users’ mental
models of a domain.
78. SPARQL – SQL for the Linked Data
Linked Open Data 48Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• SPARQL allows the conceptual data model to be fully explored
through queries.
79. SPARQL – SQL for the Linked Data
Linked Open Data 48Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• SPARQL allows the conceptual data model to be fully explored
through queries.
- example:workPhone rdfs:subPropertyOf example:phone
- example:cellPhone rdfs:subPropertyOf example:phone
- example:homePhone rdfs:subPropertyOf example:phone
80. SPARQL – SQL for the Linked Data
Linked Open Data 49Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Queries that have to traverse a chain of connections are
particularly complex in SQL while very simple in SPARQL.
81. SPARQL – SQL for the Linked Data
Linked Open Data 49Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• Queries that have to traverse a chain of connections are
particularly complex in SQL while very simple in SPARQL.
82. SPARQL – SQL for the Linked Data
Linked Open Data 50Ali Khalili
Linked Open Data Lifecycle Storage/Querying
• In addition to SELECT, INSERT and DELETE, SPARQL supports
ASK queries.
• SPARQL includes syntax (i.e. SERVICE) to call two or more data
sources within a single query.
• …
83. SPARQL Query Interface
Linked Open Data 51Ali Khalili
Linked Open Data Lifecycle Storage/Querying
http://yasgui.org/
84. Hands-on
Linked Open Data 52Ali Khalili
Linked Open Data Lifecycle Storage/Querying
Write a SPARQL query to return a list of all
universities in the Netherlands sorted by name.
DBpedia Endpoints:
http://dbpedia.org/sparql
http://live.dbpedia.org/sparql
http://dbpedia-live.openlinksw.com/sparql
http://lod.openlinksw.com/sparql
86. Semantic Wikis
Linked Open Data 54Ali Khalili
Linked Open Data Lifecycle Authoring
• Semantic (Text) Wikis
Authoring of semantically annotated text
• Semantic Data Wikis
Direct authoring of structured information
(i.e. RDF, RDF-Schema, OWL)
http://semantic-mediawiki.org/
http://aksw.org/Projects/OntoWiki
87. OntoWiki use case: Catalogus Professorum
Linked Open Data 55Ali Khalili
Linked Open Data Lifecycle Authoring Semantic Data Wiki
The Catalogus Professorum Lipsiensis – Semantics-based Collaboration and Exploration for Historians
88. OntoWiki use case: Catalogus Professorum
Linked Open Data 55Ali Khalili
Linked Open Data Lifecycle Authoring Semantic Data Wiki
The Catalogus Professorum Lipsiensis – Semantics-based Collaboration and Exploration for Historians
91. Semantic Content Annotation
Linked Open Data 56Ali Khalili
Linked Open Data Lifecycle Authoring
WYSIWY - What You See Is What You Mean
http://rdface.aksw.org
M
94. Interlinking
Linked Open Data 59Ali Khalili
Linked Open Data Lifecycle
• The degree to which entities that represent the same
concepts are linked to each other.
• “Connecting things that are somehow related”
• Methods
• Automatic, Semi-automatic, Manual
• Universal, Domain-specific
<http://dbpedia.org/resource/VU_University_Amsterdam>
<https://www.wikidata.org/entity/Q1065414>
SameAs
95. Interlinking Methods
Linked Open Data 60Ali Khalili
Linked Open Data Lifecycle
• Ontology Matching
• establish links between ontologies underlying two
data sources.
• Instance Matching (Link Discovery)
• discover links between instances contained in two
data sources.
96. SILK Framework
Linked Open Data 61Ali Khalili
Linked Open Data Lifecycle
http://silk-framework.com/
Interlinking Semi-automatic
97. LIMES Framework
Linked Open Data 62Ali Khalili
Linked Open Data Lifecycle
http://aksw.org/Projects/LIMES
Interlinking Semi-automatic
98. Hands-on
Linked Open Data 63Ali Khalili
Linked Open Data Lifecycle Interlinking
1. Universities in CWTS Leiden Ranking or Times World
University Rankings
2. Universities in H2020 EU projects
Reconcile the following resources against DBpedia:
106. EvoPat – Pattern based KB Evolution
Linked Open Data 71Ali Khalili
Linked Open Data Lifecycle Evolution
• Inspired by Software Refactoring
• Agile Knowledge Engineering
• Basic & compound evolution patterns
http://link.springer.com/chapter/10.1007%2F978-3-642-17746-0_41
107. Linked Open Data 72Ali Khalili
Linked Open Data
Interlinking
Enrichment
Quality
Analysis
Evolution
Exploration
Extraction
Storage/
Querying
Authoring
Linked (Open) Data
Lifecycle
http://stack.linkeddata.org/
Life Cycle
111. References
Linked Open Data 75Ali Khalili
Linked Open Data
• http://slidewiki.org/deck/11936_semantic-data-web-lecture-series
• Introduction to linked data and its lifecycle on the web
• http://euclid-project.eu/
• http://videolectures.net/wims2011_auer_interlinked/
• https://vimeo.com/76257120
• http://www.slideshare.net/slidarko/evolving-the-web-into-a-giant-global-
database-3880018
• http://www.dataversity.net/introduction-to-triplestores/
• http://www.topquadrant.com/2014/05/05/comparing-sparql-with-sql/