This document discusses open data and the 5 star open data model. It describes each level of the 5 star model from 1 to 5 stars. 1 star data is available on the web with an open license. 2 star data is structured. 3 star data uses open formats. 4 star data uses URLs to identify things. 5 star data links to other open datasets. The document provides several examples of open data and advocates for making data more open and accessible on the web.
1. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Dr. Sabin Buraga
Faculty of Computer Science, “A. I. Cuza” of Iasi, Romania
www.purl.org/net/busaco @busaco4web
2. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
open participation
open data
open software
open app development
open web
open cloud
open (computing) hardware
3. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
World Wide Web = “a common information space
in which we communicate by sharing information”
Tim Berners-Lee (2013)
4. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Internet
(Web)
Client Web application Storage
(user interface) server/framework (data persistence)
5. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
URL – Uniform Resource Identifier
addressability
Internet
(Web)
Client Web application Storage
(user interface) server/framework (data persistence)
for example: http://www.slideshare.net/busaco/presentations/
6. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
HTTP – HyperText Transfer Protocol
access to resources
Internet
(Web)
Client Web application Storage
(user interface) server/framework (data persistence)
a browser asks a Web server to provide a resource representation
7. HTML (HyperText Markup Language), JSON, PDF, PNG,…
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
representation(s) of a resource
Internet
(Web)
Client Web application Storage
(user interface) server/framework (data persistence)
a Web page includes URLs to other resourceshypermedia
8. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Reusing & sharing data available on the Web
data access via a Web service
usually, by using an API
(Application Programming Interface)
9. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Web servicespublic APIsmash-ups
www.programmableweb.com
10. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
open data
“A piece of content or data is open
if anyone is free to use, reuse, and redistribute it.”
http://opendefinition.org/
11. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
>
“If you have access to the data,
then you can achieve continuity
even if you don’t have access to
the underlying source of the application.
Open data is more important than open source. […]
Data persists, open data endures.”
Ian Davis, 2009
13. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Reusing data available on the Web
necessity of adopting a (re)use license
14. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Reusing data available on the Web
necessity of adopting a (re)use license
fair use
public domain
copyleft
15. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Reusing data available on the Web
necessity of adopting a (re)use license
Creative Commons
17. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Data availability
on the Web
as “opaque” document
(usually, using a proprietary format)
does not refer – via current Web technologies –
other resources of interest
Tom Health (2007)
18. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Data availability
in the Web
assuring discoverability via hypermedia
uses open data models/formats
(e.g., HTML, XML, JSON, CSV, RDF etc.)
platform independent
Tom Health (2007)
19. Can we evaluate the data openness?
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
20. 5 ★ Open Data
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Tim Berners-Lee (2009)
21. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
1-star data
the content is available on the Web – by using any
format – according to an open license
http://opendefinition.org/licenses/
22. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
users can view, print, locally store,
and – eventually – modify the document
the document itself can be shared on the Internet
23. a PDF containing a scanned image ☹
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
24. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
the document could be easily published on the Web
in order to reuse the data kept into the document,
additional processing might be necessary
25. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
2-star data
additionally, the content must be available
as structured data (e.g., relations between entities)
26. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
users can process the document by using, in most cases,
a proprietary software application
the document can be exported
into another (structured) format
27. a proprietary format
containing structured data ☹
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
28. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
the document can be easily published on the Web
data is still “locked” into the document +
processing is depending by a specific application
29. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
3-star open data
using an open (non-proprietary) format
to make data available on the Web
30. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
same content as HTML document ☺
<section>
<p>10:15 – 11:00</p>
<p>Towards 5-Star Data in the E-university</p>
<p>Presenter: Sabin Buraga</p>
</section>
31. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
data can be managed (viewed, processed, filtered,
converted, shared, reused, etc.) in any manner
important aspect: platform independence
32. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
the document is still rather simple to be published on Web
exporting data into a proprietary format
could be problematic
33. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
4-star open data
each “thing” (entity) of interest from the document
is denoted by a Web address – URL
34. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
data, information, and knowledge are identified via URLs
in order to be accessed and (re)used
RDF model (Resource Description Framework)
W3C standards (1998, 2004, 2014)
www.w3.org/standards/semanticweb/
36. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
content publishing could be much difficult, employing
the adoption of the semantic Web – or Web of Data –
technologies, tools, and methodologies
data in the Weblong term implications
37. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
5-star open data
additionally, data is inter-connected to other
datasets, according to the linked data initiative
http://linkeddata.org/
38. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
inter-connecting open datasets ☺
http://lod4all.net/
39. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
possibility to discover other (related) data of interest
while consuming the datanetwork effect
other advantage: Web-based automatic reasoning
40. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
difficulties:
assuring data/knowledge consistency
problems related to slow adoption
42. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
★
make your stuff available on the Web
(whatever format) under an open license
★★
make it available as structured data
e.g., Excel instead of image scan of a table
★★★
use non-proprietary formats
e.g., CSV (Comma Separated Values) instead of Excel
★★★★
use Web addresses (URLs) to denote things,
so that people can point at your stuff
★★★★★
link your data to other data – see http://datahub.io/ –
to provide context
Ed Summers (2010)
43. Several real-life examples
(in the academic context)?
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
45. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
freely available knowledge bases: DBpedia & Freebase
http://en.lodlive.it/
46. open e-science – see myexperiment.org
Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
47. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
open e-government
access to official data according to the openness score
http://data.gov.uk/data/search?openness_score=5
48. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
promoting open data & open software
student workshops & contests co-organized by
Faculty of Computer Science – UAIC Romania
Design Jam Iasi (3 editions), Firefox OS App Day, Firefox
OS Hackathon, Open Source Iasi, Winter Web Workshop
and many others
49. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
open access to various student projects & initiatives
Faculty of Computer Science – UAIC Romania
http://profs.info.uaic.ro/~stefan.negru/studentprojects/
50. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
open access to various
educational resources
Faculty of Computer Science
UAIC Romania
http://profs.info.uaic.ro/~busaco/teach/
51. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
“Software – as a service or not – is just a container.
What makes software valuable has always been what
it does to data. Now, in the same spirit of SOA (Service
Oriented Architecture) and SaaS (Software As A Service),
a new concept is emerging, Data-as-a-Service – DaaS.”
Pete Soderling (2010)
52. Dr. Sabin-Corneliu Buraga – www.purl.org/net/busaco
Dr. Sabin Buraga
Faculty of Computer Science, “A. I. Cuza” of Iasi, Romania
www.purl.org/net/busaco @busaco4web