Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Dr. Sabin Buraga
Faculty of Computer Science, UAIC Iasi, Romania
profs.info.uaic.ro/~busaco/  slideshare.net/busaco
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
open participation
open data
open software
open app development
open web
open cloud
open (computing) hardware





⛈
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/

World Wide Web = “a common information space
in which we communicate by sharing information”
Tim Berners-Lee (2013)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
URL – Uniform Resource Identifier
addressability
for example: http://www.slideshare.net/busaco/presentations/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
HTTP – HyperText Transfer Protocol
access to resources
a browser asks a Web server to provide a resource representation
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
HTML, JSON, PDF, PNG, SVG,…
representation(s) of a resource
a Web page includes URLs to other resourceshypermedia
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Reusing & sharing data available on the Web
data access via a Web service
usually, by using an API
(Application Programming Interface)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Web servicespublic APIsmash-ups
www.programmableweb.com
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
APIs could be described via an open format
(see OpenAPI specifications): http://theapistack.com/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
aging…
James Governor (2007)
software ≈ fishdata ≈ wine
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/

open data
“A piece of content or data is open
if anyone is free to use, reuse, and redistribute it.”
http://opendefinition.org/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
 > 
“If you have access to the data,
then you can achieve continuity
even if you don’t have access to
the underlying source of the application.
Open data is more important than open source. […]
Data persists, open data endures.”
Ian Davis, 2009
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
legal/technical openness
availability & access
reusing & sharing
universal participation

inter-operability
opendatahandbook.org
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Reusing data available on the Web
necessity of adopting a (re)use license
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Reusing data available on the Web
necessity of adopting a (re)use license
fair use
public domain
copyleft
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Reusing data available on the Web
necessity of adopting a (re)use license
Creative Commons
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
openness, transparency, respect
https://creativecommons.org/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Data availability
on the Web
as “opaque” document
(usually, using a proprietary format)
does not refer – via current Web technologies –
other resources of interest
Tom Health (2007)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Data availability
in the Web
assuring discoverability via hypermedia
uses open data models/formats
(e.g., HTML, XML, JSON, CSV, RDF etc.)
platform independent
Tom Health (2007)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Can we evaluate the data openness?
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
5 ★ Open Data
Tim Berners-Lee (2009)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
1-star data
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
1-star data
the content is available on the Web – by using any
format – according to an open license
http://opendefinition.org/licenses/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
users can view, print, locally store,
and – eventually – modify the document
the document itself can be shared on the Internet
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
a PDF containing a scanned image ☹
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
the document could be easily published on the Web
in order to reuse the data kept into the document,
additional processing might be necessary
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
2-star data
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
2-star data
additionally, the content must be available
as structured data (e.g., relations between entities)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
users can process the document by using, in most cases,
a proprietary software application
the document can be exported
into another (structured) format
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
a proprietary format
containing structured data ☹
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
the document can be easily published on the Web
data is still “locked” into the document +
processing is depending by a specific application
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
3-star open data
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
3-star open data
using an open (non-proprietary) format
to make data available on the Web
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
same content as HTML5 document ☺
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
<section class="timeslot" lang="en">
<div class="timeslot-label">
<time class="start-time"
datetime="20160508T11:45">
11<span>45</span>
</time>
<time class="end-time"
datetime="20160508T12:45">
12<span>45</span>
</time>
</div>
<p class="title">Why 5-Star Data?</p>
<p class="speaker">Sabin-Corneliu Buraga</p>
</section>
denoting a certain meaning from
the document’s author point of view
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
data can be managed (viewed, processed, filtered,
converted, shared, reused, etc.) in any manner
important aspect: platform independence
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
the document is still rather simple to be published on Web
exporting data into a proprietary format
could be problematic
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
4-star open data
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
4-star open data
each “thing” (entity) of interest from the document
is denoted by a Web address – URL
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
data, information, and knowledge are identified via URLs
in order to be accessed and (re)used
RDF (Resource Description Framework) model
W3C standards
www.w3.org/standards/semanticweb/
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
machine-friendly RDF assertions ☺
<!-- the thing identified by
‘busaco’ is a person -->
<div resource="#busaco"
typeof="foaf:Person">
<a property="url" href="...">
<span property="name">
Sabin Buraga</span>
</a>
</div>
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
machine-friendly RDF assertions ☺
towards classes of things:
presentations
persons
organizations
...
things, not strings
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
content publishing could be much difficult, employing
the adoption of the semantic Web – or Web of Data –
technologies, tools, and methodologies
data in the Weblong term implications
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
5-star open data
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
5-star open data
additionally, data is inter-connected to other
datasets, according to the linked data initiative
linkeddata.org
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
inter-connecting open datasets ☺
graphofthings.org
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
possibility to discover other (related) data of interest
while consuming the datanetwork effect
other advantage: Web-based automatic reasoning
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
difficulties:
assuring data/knowledge consistency
problems related to slow adoption
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
5stardata.info
Michael Hausenblas (2012)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
★
make your stuff available on the Web
(whatever format) under an open license
★★
make it available as structured data
e.g., Excel instead of image scan of a table
★★★
use non-proprietary formats
e.g., CSV (Comma Separated Values) instead of Excel
★★★★
use Web addresses (URLs) to denote things,
so that people can point at your stuff
★★★★★
link your data to other data – see http://datahub.io/ –
to provide context
Ed Summers (2010)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Several real-life examples?
Dr.SabinBuragawww.purl.org/net/busaco
augmenting the current Web search activities
via HTML5 schema.org + RDFa rdfa.info
Dr.SabinBuragawww.purl.org/net/busaco
access to public datasets ☺
Dr.SabinBuragawww.purl.org/net/busaco
Academic Torrents
http://academictorrents.com/
Awesome Public Datasets
https://github.com/caesar0301/awesome-public-datasets
Awesome JSON Datasets
https://github.com/jdorfman/awesome-json-datasets
Common Crawl
http://commoncrawl.org/the-data/
DataHub
https://datahub.io/dataset
Dr.SabinBuragawww.purl.org/net/busaco
DBpedia.org
a crowd-sourced
community effort
to extract structured
information from Wikipedia
in order to be
“intelligently” processed
by software
Dr.SabinBuragawww.purl.org/net/busaco
Wikidata.org – a free knowledge base that can be read
and edited by both humans & machines
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
open e-government: visualizing + comparing quality indicators
(license, formats, availability, metadata) regarding open datasets
opendatamonitor.eu
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
“Software – as a service or not – is just a container.
What makes software valuable has always been what
it does to data. Now, in the same spirit of SOA (Service
Oriented Architecture) and SaaS (Software As A Service),
a new concept is emerging, Data-as-a-Service – DaaS.”
Pete Soderling (2010)
Dr.Sabin-CorneliuBuraga–http://profs.info.uaic.ro/~busaco/
Dr. Sabin Buraga
Faculty of Computer Science, UAIC Iasi, Romania
profs.info.uaic.ro/~busaco/  slideshare.net/busaco

Why 5-Star Data?