Introducing
Open
Data
Ian Henshaw
Registered ODI Trainer
ian.henshaw@gmail.com
@ihenshaw
Overview
●
Defining open data
●
Benefits
●
Open, closed and personal data
●
Why now?
Aims
●
What is open data, its benefits and
opportunities
●
Case studies of successful open data
projects
●
How to discover and use open data?
Learning Objectives
●
Describe open data, Explain the difference
between big, personal and open data
●
Describe the benefits, of working with open data
●
Describe successful case studies of using and
publishing open data
●
Explain how open data sits in the current
economic climate
What is Data?
Exercise
Definition of Data (1)
A collection of facts, information
and statistics that can be analyzed
to develop new knowledge
Definition of Data (2)
A collection of numbers assigned as
values to quantitative variables
and/or characters assigned as
values to qualitative variables
Definition of Data (3)
The lowest level of abstraction
from which information and then
knowledge are derived.
The DIKW Pyramid
Data
Information
Knowledge
Wisdom
Link
Elements
Organize
Information
Apply
Knowledge
What is Open Data?
Exercise
Definition of Open (OKF)
A piece of data or content is open if
anyone is free to use, reuse, and
redistribute it - subject only, at most,
to the requirement to attribute and/or
share-alike.
Definition of Open (ODI)
Open data is data that is made
available by organizations, businesses
and individuals for anyone to access,
use and share.
●
Open data has to have a license that says it is open data. Without a
license, the data can’t be reused. The license might also say:
– that people who use the data must credit whoever is publishing it (this
is called attribution)
– that people who mix the data with other data have to also release the
results as open data (this is called share-alike)
Definition of Open Data
Open Data is accessible public
data that people, companies
and organizations can use to
launch new ventures, analyze
patterns and trends, make
data-driven decisions, and
solve complex problems.
http://www.opendatanow.com/book-open-data-now/
ISBN-10: 0071829776
ISBN-13: 978-0071829779
opendefinition.org
Access
Redistribution
Reuse
Integrity
Attribution
Non-discriminatory
Remix and
Combine
“Data is the new Oil” - Clive Humby
kenhodge13 (40132991@N07) on flickr.com
“Data is just like crude.” - Michael Palmer
●
It’s valuable, but if unrefined it cannot really be
used.
●
It has to be changed into gas, plastic,
chemicals, etc to create a valuable entity that
drives profitable activity
Data must be broken down and
analyzed for it to have value.
Data is the raw material of the
new industrial revolution
- Francis Maude MP
Justifications
Trust and
Transparency
Enabling the
Economy
Benefits of Open Data
●
Data Driven Decision Making
●
Performance Measurement
●
Reduction of Government Costs
●
Support an Open Government Initiative
– e.g. Transparency
●
Economic Development
●
Increased Citizen Engagement
●
Talent Attraction / Retention
Data Driven Decision Making
http://london-
fire.labs.theodi.org/explore/
Performance Measurement
https://data.maryland.gov/
goals/energy-efficiency
Reduction of Government Costs
http://theodi.org/news/prescription-savings-worth-millions-identified-odi-incubated-company
Reduction of Government Costs
http://www.bloomberg.com/
news/2014-04-09/top-
medicare-doctor-paid-21-
million-in-2012-data-
shows.html
Support for Open Government
http://cms.gov/openpayments/
Support for Open Government
http://cms.gov/openpayments/
Support for Open Government
http://www.openbudget.ny.gov/
Economic Development
http://www.mckinsey.com/insights/high_tech_telecoms_internet/what_executives_should_know_about_open_data
Report:
January 2014
Economic Development
http://www.gpsalliance.org/docs/GPS_Report_June_21_2011.pdf
The Economic Benefits of Commercial GPS
Use in the United States and the Costs of
Potential Disruption
“GPS technology will create $122.4 billion
benefits per year and will directly affect more
than 5.8 million jobs in the downstream
commercial GPS-intensive industries when
penetration of GPS technology reaches 100
percent in the commercial GPS-intensive
industries.”
Increased Citizen Engagement
Talent Attraction / Retention
Presentation to the world class city partnership
annual conference in Barcelona focusing on Dublin
and talent attraction by Jamie Cudden
http://www.slideshare.net/jcudden/dublin-talent-
presentation-23-06-12-wccp
http://www.indychamber.com/news/indy-chamber-
news/hacking-talent-pool/
NC Cities: Open Data Portals
http://technologytank.org/2015/02/26/open-data-portals-in-nc/
Examples of Open Data
● Government Data ● Commercial Data
Examples of Open Data
● Crowd Sourced Data
GODAN Report
Global Open Data for
Agriculture and Nutrition
initiative: Open Data can help
solve these problems:
●
Enabling more efficient and
effective decision making
●
Fostering innovation that
everyone can benefit from
●
Driving organizational and
sector change through
transparency
http://www.godan.info/wp-content/uploads/2015/04/ODI-GODAN-paper-27-05-20152.pdf
GODAN Use Cases
Enabling more efficient and effective decision making
●
Protecting crops from pest outbreaks with vegetation maps:
GroenMonitor
●
Helping farmers forecast with weather apps and SMS: Awhere
●
Boosting crop yields with a best practice knowledge bank:
Plantwise
●
Saving $3.6m in drought damage with a climate-smart tool:
CIAT Colombia
●
Managing the California drought with data visualizations:
California Department of Water Resources
GODAN Use Cases
Fostering innovation to benefit everyone
●
Saving crops and cash with weather simulation and
smart insurance: Climate Corporation
●
Improving crop varieties with open data on breeding
trials: AgTrials
●
Bringing agricultural research to the masses: FAO
AGRIS portal AGRIS
●
Making agri-food data more discoverable: the
CIARD RING
GODAN Use Cases
Driving organizational and sector change through transparency
●
Tracking water, pesticide, water and fuel use with an open,
collaborative platform: Syngenta
●
Exposing misspent farm subsidies in Mexico: FUNDAR
●
Empowering consumers to make smart food choices: US
national nutrient database
●
Helping consumers understand risks of the food they eat: EU
Food alerts
●
Highlighting restaurant inspection scores and improving food
safety: LIVES
Syngenta Good Growth Plan
http://www.syngenta.com/global/corporate/en/goodgrowthplan/Documents/pdf/The
%20Good%20Growth%20Plan_%2016pp%20brochure%20ENG.pdf
Activity
●
Harris Teeter
●
Lenovo
●
Avon
●
JC Penney
●
Proctor & Gamble
●
Exxon / Mobil
●
What opportunities can you identify with open
data?
– What data can you give?
– What data do you need?
Challenges and Risks
http://theodi.github.io/data-definitions/
Types of personal data
Open personal data
Data about people
not a person
Available to anyone
Has been anonymised
e.g. number of people attending
event, gender split, age ranges.
(bigger numbers are better!)
Available personal
data
Data about a person
Available to the
person only!
Often known as
MiData
e.g. credit scores, energy and
other consumption data.
Personal data
Data about a person
which is neither
open nor available.
Might belong to you
or be collected by a
company.
Opportunities
●
Open Data
– Brings transparency, open peer review
●
Big Data
– Brings the evidence
●
Personal Data
– Makes it relevant
Opportunity
http://theodi.github.io/data-definitions/
Discovering Data
●
The idea of this exercise is to help you gain a
better understanding and be able to critically
appraise the different types of open data
publication practice on the web.
●
http://training.theodi.org/resources/Discoveri
ng_Open_Data_Exercise.pdf
Why Now?
A Global Movement
http://theodi.github.io/open-data-barometer-viz/
Knowledge
for
everyone

Open Data in a Day - Introduction to Open Data