The Reality of Linked Data

•Download as PPT, PDF•

8 likes•2,261 views

Keynote at Online Information 2009, delivered on 3rd December. I discuss hype and reality and focus on linked data as the dominant design for publishing data on the web. This

Technology Education

The Reality of Linked Data Ian Davis, CTO, Talis Online Information 2009

“ A significant change in the computer field in the last five to eight years has been made in the way we treat and handle data. In the early days of our field, data was intimately tied to the application programs that used it. Now we see that we want to break that tie. We want data that is independent of the application programs that use it – that is, data that is organized and structured to serve many applications and many users. What we seek is the...”

“ Copernicus completely reoriented our view of astronomical phenomena when he suggested that the earth revolves around the sun. There is a growing feeling that data processing people would benefit if they were to accept a radically new point of view, one that would liberate the application programmer's thinking from the centralism of core storage and allow him the freedom to act as a navigator within a database.”

“ Both software and the hardware needed remain immature, that little experience so far existed in its use and that the generalized features offered by the DBMS brought a hefty performance penalty”

Linked Data is a Dominant Design for the Semantic Web

http://www.bbc.co.uk/programmes/b006mf4b

http://www.bbc.co.uk/programmes/b006mf4b.rdf

Find out more http://www.talis.com/platform http://blogs.talis.com/nodalities [email_address]

IOSR journal of VLSI and Signal Processing (IOSRJVSP) is a double blind peer reviewed International Journal that publishes articles which contribute new results in all areas of VLSI Design & Signal Processing. The goal of this journal is to bring together researchers and practitioners from academia and industry to focus on advanced VLSI Design & Signal Processing concepts and establishing new collaborations in these areas. Design and realization of microelectronic systems using VLSI/ULSI technologies require close collaboration among scientists and engineers in the fields of systems architecture, logic and circuit design, chips and wafer fabrication, packaging, testing and systems applications. Generation of specifications, design and verification must be performed at all abstraction levels, including the system, register-transfer, logic, circuit, transistor and process levels

The Future of Cloud Computing

Ahmed Banafa

Cloud Computing Documentation Report

Ajit Yadav

Sears web30e connectionartificialintelligencehrpiza

Above the cloudsrussel_uk

Cloud Computing Berkleykrmartin_dal

Zpryme Report on Cloud and SAS SolutionsPaula Smith

Service Level Comparison for Online Shopping using Data Mining

IIRindia

The term knowledge discovery in databases (KDD) is the analysis step of data mining. The data mining goal is to extract the knowledge and patterns from large data sets, not the data extraction itself. Big-Data Computing is a critical challenge for the ICT industry. Engineers and researchers are dealing with the cloud computing paradigm of petabyte data sets. Thus the demand for building a service stack to distribute, manage and process massive data sets has risen drastically. We investigate the problem for a single source node to broadcast the big chunk of data sets to a set of nodes to minimize the maximum completion time. These nodes may locate in the same datacenter or across geo-distributed data centers. The Big-data broadcasting problem is modeled into a LockStep Broadcast Tree (LSBT) problem. And the main idea of the LSBT is defining a basic unit of upload bandwidth, r, a node with capacity c broadcasts data to a set of [c=r] children at the rate r. Note that r is a parameter to be optimized as part of the LSBT problem. The broadcast data are further divided into m chunks. In a pipeline manner, these m chunks can then be broadcast down the LSBT. In a homogeneous network environment in which each node has the same upload capacity c, the optimal uplink rate r, of LSBT is either c=2 or 3, whichever gives the smaller maximum completion time. For heterogeneous environments, an O(nlog2n) algorithm is presented to select an optimal uplink rate r, and to construct an optimal LSBT. With lower computational complexity and low maximum completion time, the numerical results shows better performance.The methodology includes Various Web applications Building and Broadcasting followed by the Gateway Application and Batch Processing over the TSV Data after which the Web Crawling for Resources and MapReduce process takes place and finally Picking Products from Recommendations and Purchasing it.

Viewers also liked

DataIncubator at Linked Data Meetup February 2010

Ian Davis

Voyager : Query Basic

Michael Cummings

Streamlining acquisitions process at Bangor University

BangorUniversityLibrary

Data Visualization: Analyzing your library data

Michael Cummings

Ex Libris REST API Governance Thresholds

joshmweisman

Archivists toolkit SQL - a tutorial

Michael Cummings

Enhancing a library OPAC with linked data

Michael Cummings

SPARQL Tutorial

Leigh Dodds

The semantic web

Dotkumo

30 Minute Guide to RDF and Linked Data

Ian Davis

Viewers also liked (10)

DataIncubator at Linked Data Meetup February 2010

Voyager : Query Basic

Streamlining acquisitions process at Bangor University

Data Visualization: Analyzing your library data

Ex Libris REST API Governance Thresholds

Archivists toolkit SQL - a tutorial

Enhancing a library OPAC with linked data

SPARQL Tutorial

The semantic web

30 Minute Guide to RDF and Linked Data

How world-class product teams are winning in the AI era by CEO and Founder, P...

Product School

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

FIDO Alliance

Assure Contact Center Experiences for Your Customers With ThousandEyes

ThousandEyes

When stars align: studies in data quality, knowledge graphs, and machine lear...

Elena Simperl

UiPath Test Automation using UiPath Test Suite series, part 4

DianaGray10

Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap. The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies. Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques What will you get from this session? 1. Insights into SAP testing best practices 2. Heatmap utilization for testing 3. Optimization of testing processes 4. Demo Topics covered: Execution from the test manager Orchestrator execution result Defect reporting SAP heatmap example with demo Speaker: Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

By Design, not by Accident - Agile Venture Bolzano 2024

Pierluigi Pugliese

UiPath Test Automation using UiPath Test Suite series, part 3

DianaGray10

Recently uploaded (20)

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf

The Art of the Pitch: WordPress Relationships and Sales

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...

Accelerate your Kubernetes clusters with Varnish Caching

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024

PHP Frameworks: I want to break free (IPC Berlin 2024)

Epistemic Interaction - tuning interfaces to provide information for AI support

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...

GraphRAG is All You need? LLM & Knowledge Graph

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf

How world-class product teams are winning in the AI era by CEO and Founder, P...

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

Assure Contact Center Experiences for Your Customers With ThousandEyes

When stars align: studies in data quality, knowledge graphs, and machine lear...

UiPath Test Automation using UiPath Test Suite series, part 4

By Design, not by Accident - Agile Venture Bolzano 2024

UiPath Test Automation using UiPath Test Suite series, part 3

The Reality of Linked Data

1. The Reality of Linked Data Ian Davis, CTO, Talis Online Information 2009

2. “ A significant change in the computer field in the last five to eight years has been made in the way we treat and handle data. In the early days of our field, data was intimately tied to the application programs that used it. Now we see that we want to break that tie. We want data that is independent of the application programs that use it – that is, data that is organized and structured to serve many applications and many users. What we seek is the...”

3. ... data base

4. 1973

5. “ Copernicus completely reoriented our view of astronomical phenomena when he suggested that the earth revolves around the sun. There is a growing feeling that data processing people would benefit if they were to accept a radically new point of view, one that would liberate the application programmer's thinking from the centralism of core storage and allow him the freedom to act as a navigator within a database.”

6. “ Both software and the hardware needed remain immature, that little experience so far existed in its use and that the generalized features offered by the DBMS brought a hefty performance penalty”

9. Linked Data is a Dominant Design for the Semantic Web

10. Your web site is your API

11. Identify

12. Describe

13. Respond

14. Your web site is your API

15. http://www.bbc.co.uk/programmes/b006mf4b

16. http://www.bbc.co.uk/programmes/b006mf4b.rdf

17.

18.

19.

20.

21.

22.

23.

24.

25.

26. The reality of Linked Data

27. Identify Describe Respond

28. Find out more http://www.talis.com/platform http://blogs.talis.com/nodalities [email_address]

Editor's Notes

The title of my talk today is the reality of linked data and I want to show you what is possible today with linked data, who else is using it and how you can get started. But first, I'd like to read this quote that I came across recently
A data base. Two words:data base. This isn't a software system this is a base of data.
Those words are from Richard G. Canning in his introduction to the 1973 Turing Award. 1973! The sentiment is very familiar today nearly four decades later. The recipient of that year's Turing Award was Charles W. Bachman a pioneer in the field of databases. In his acceptance lecture Bachman compared the change in thinking needed for information systems to that of Copernicus
Bachman was speaking against a background of a decade of hype for database management systems. The technology was seen as a means of enabling everyone in an organisation to have access to information “at their fingertips”. Even senior managers would be using the technological marvel of the database. This myth was brought down to earth in the mid seventies with a series of damning reports.
One stated: In addition no survey of the early 1970's were able to find any firms where the database was used directly by managers or even by analysts. By 1981 the market leading datanase system TOTAL had only 4000 installations while IBM's IMS was in second place with around 1500. But in the same year, in the midst of a severe recession, RSI renamed itself Oracle, Sequoia Capital provided growth investment and the rest is history. Today even our managers can access the data they need.
This process, this technology adoption process, is well understood these dats and is best illustrated by this famous diagram. There is this crucial period as the technology starts up the slop of elightenment. That's the point that Oracle got started. After the hype had died away and people started taking a serious look at the reality of the technology. I think this is where we are with the Semantic Web today.
This is also about the time that the industry converges on dominant designs. This is an accepted pattern for a technology, like the pedals in a car. Dominant designs don't stifle innovation but they drive adoption. Massively.
Linked Data is a dominant design for the Semantic Web. it lays down a standard pattern for publishing data so it can be found and reused.
One of the things that Linked Data teaches us is that your website is your API. What does that mean? It means that with a little extra effort to publish data as well as your normal HTML you can enable people to use your site to build other services and applications. Making your site into an API is simple
The most important thing you can do also happens to be the simplest. Look at your data and think about what it is about – the places, people and things. Then give each of those things an identifier, a URI, just like you do with your web pages. By assigning URIs to things you enable other people to talk about them. You enable people to link to them.
The next most important thing you can do is to describe those things using RDF. Your descriptions don't have to be sophisticated. Do as much or as little work as you can afford. The better the descriptions are though, the more useful they will be for other people. Including links to other things gives your description context.
Finally you should respond to requests on your identifiers by sending your description of that thing. You can just serve the plain old RDF, or to be more helpful you can provide HTML versions of the descriptions too. If you use RDFa then you can do both in a single document.
With these three steps you have turned your website into an API. In fact its the best kind of API because its users don't need any special software to use it. Also they don't need to learn a new API for every site they want to use. This talk is about the reality of linked data, not the hype. So which real companies and organisations are doing this today?
The BBC for one. They are publishing their programme catalogue as linked data. And they don't compromise on style or usability.
The data for all these BBC programmes is right there behind the page. Every programme has an identifier, a URI. Every segment of a programme, every brand, every person. In fact all important things in the BBC data has a URI.
When you turn your website into an API using linked data you find that people start building new things that reuse your data in new and interesting ways. This is fanhu.bz a prototype service that uses linked data from the BBC programmes pages and remixes it with Twitter to build a social space for fans of BBC programmes.
The BBC also expose linked data for their music site. Interestingly this site reuses linked data from two other sources: dbpedia and musicbrainz
This is LIBRIS the Swedish Union Catalogue publishing linked data in exactly the same way
Here is the UK government doing exactly the same, this time with education data.
The Library of Congress Subject Headings
The New York Times name subject headings. Incidently the New York Times have wonderful metaphor to describe their linked data: they call it their treasure map.
All the sites I have shown so far have been read-only. But you can use linked data for fully interactive web apps too. This is Talis Aspire, one of our products, used by the University of Plymouth. This is a reading list for a module in a mathematics course. All of this is, of course, available as linked data. Because it is also an API the university can reuse this data in lots of different contexts with very little effort.
But this is a powerfil interactive application with full editing capabilities. Talis Aspire allows teaching staff to build reading lists using a simple bookmarklet that the detects the page being viewed and saves it to a reading list.
Today, to obtain the metadata for that journal, we have to screen scrape the page to look for text that looks like a DOI (if we are lucky). That is then looked up in a separate repository. Just think how much simpler and less error prone it would be if the publishers website were its API. It could be if they just published linked data.
So what I have shown you is the reality of linked data. Forget the hype and don't be disillusioned. You can be productive today and turn your website into your API.
Remember to identify the important things with URIs, describe them using RDF and respond with those descriptions when people request your identifiers.

The Reality of Linked Data

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (10)

Similar to The Reality of Linked Data

Similar to The Reality of Linked Data (20)

Recently uploaded

Recently uploaded (20)

The Reality of Linked Data

Editor's Notes