SlideShare a Scribd company logo
1 of 22
Download to read offline
Digital Cholera
Peter Wells - @peterkwells
OpenTech - June 2015
Cholera
The last time I was in this building I went to
a talk on an early example of data analysis
and data visualisation.
John Snow famously traced a fatal cholera
epidemic in Soho in 1854 to a local water
pump.
Because of cholera in the pump the water
was not safe to use.
Read more about John Snow: http://en.wikipedia.org/wiki/John_Snow_%28physician%29
@peterkwells
Cholera and infrastructure
The Soho outbreak started at a
water pump, it could have been a
water reservoir.
The cholera bacteria would
spread and contaminate the water
downstream. An entire set of
water infrastructure could have
been contaminated.
The water would not have been
safe to use. Yet water is essential
to life.
Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/
@peterkwells
Safe water
As a society we invest in water
infrastructure. We have:
- inspections
- alerting systems
- purification
- education
We put more focus at the top of the
infrastructure, on water producers and
distributors, than we do on water users.
The goal is to make water that’s safe
for people to use.
A Doctor from the World Health Organisation
@peterkwells
Get to the digital...
@peterkwells
Open Addresses
Organisations have to buy lists of UK addresses, licensing is complicated, the
quality isn’t great, the data doesn’t meet all the needs.
It’s hard to build new services.
Open Addresses explored whether it was possible to build a new UK address list,
to make things simpler and make addresses more widely used.
@peterkwells
Addressing needs
Denmark had a 1000% increase in the organisations that use address data by
making address data simpler to use.
We discovered other needs and benefits:
- people who move into new houses need their addresses to be published faster
- people name their houses and need other people to know about it
- people need it to be easier to enter addresses on websites
- (I could go on…)
@peterkwells
More and better services that would make life
a little bit easier
Getting addresses
As well as understanding the needs we had to find data.
There are 26-40m addresses in the UK.
The Land Registry publishes over 18 million addresses in the Price Paid Dataset.
Sounds great!
@peterkwells
Aside: we also did some neat stuff on mathematical inference for addresses.
Check out www.openaddressesuk.org...
Land Registry says no...
Image from Owen Boswarva: http://mapgubbins.tumblr.com/post/107499166390/it-was-all-a-dream-land-registrys-price-paid
@peterkwells
Third Party Rights are
complex and can be fatal
Address datasets can include third-party database rights:
1. if the data was directly copied from an existing address database
2. if an existing list of addresses (obtained through another route) was corrected or
validated based on an existing address database
Unauthorised use of third party rights creates risk for both data publishers and
consumers.
The service can simply…... stop.
@peterkwells
Third party rights, they’re
everywhere!
As we inspected other datasets we saw similar issues with unauthorised rights:
- websites for data capture that used third party address products
- datasets that had been cleansed with third party address products
- a clean website followed by automated back-end validation
Even with submission guidelines, provenance tracking and takedown policies the legal
position for Open Addresses was really complex.
We made a :(
@peterkwells
Lightbulb
It is complicated to determine if unauthorised third party rights
exist. You need to inspect the data and how it was produced
@peterkwells
Image by Richard Rutter: https://www.
flickr.com/photos/clagnut/
Safe water - a reprise
As a society we invest in water
infrastructure:
- inspections
- alerting systems
- purification
- education
We put more focus at the top of the
infrastructure, on water producers and
distributors, than we do on water users.
The goal is to make water that’s safe
for people to use.
Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/
@peterkwells
A Doctor from the World Health Organisation
Digital cholera
@peterkwells
Copyright is a good thing (don’t believe me? ask a musician) so I’m using a harsh metaphor, but
the metaphor is useful.
Don’t take away
my copyright!
Digital cholera
@peterkwells
The water may be infected with
cholera.
Therefore we inspect it to see if
the water is safe to use.
Land Registry address data may
be infected with digital cholera.
Therefore we inspect it to see if
the data is safe to use.
We learnt it wasn’t so we didn’t….
Digital cholera
@peterkwells
Not just about unauthorised third party rights.
Inappropriate releases of personal data.
Incomplete data.
Incorrect data.
Remember it’s a metaphor.
Digital cholera
@peterkwells
Can we learn more from how society learnt to deal with cholera in water?
Alerting system?
@peterkwells
We’ve told Land Registry of the problem(s).
We’ve published articles to alert others.
We’re here.
Should this be better?
Purification?
@peterkwells
Tricky. There is no equivalent of a purification tablet.
We need to cleanse data infrastructure of digital cholera or we need to rebuild it.
It is simplest if the data is kept pure by whoever creates and maintains it.
Just as with water.
Education
@peterkwells
The ODI already have a wealth of education material and are including the thinking and
learning from Open Addresses in some future work:
Send your ideas more here:http://theodi.org/who-owns-our-data-infrastructure?
Water is essential to life so we invest in
maintaining our water infrastructure to make
water safe to use.
Data gives us more and better services. It is is
essential to life. We need to invest in
maintaining useful data infrastructure to make
data safe to use.
@peterkwells
@peterkwellsImage by Don Graham: https://www.flickr.com/photos/23155134@N06/
If we don’t look after our
data infrastructure we risk
simply ending up with
some rusty and unused
data pumps….

More Related Content

Similar to Open tech digital cholera

Essay On Picture Composition In Hindi
Essay On Picture Composition In HindiEssay On Picture Composition In Hindi
Essay On Picture Composition In HindiEmily Garcia
 
Writing An Essay Plan. Online assignment writing service.
Writing An Essay Plan. Online assignment writing service.Writing An Essay Plan. Online assignment writing service.
Writing An Essay Plan. Online assignment writing service.Katie Parker
 
Social Bar - Open Data
Social Bar - Open DataSocial Bar - Open Data
Social Bar - Open DataEdial Dekker
 
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docxsleeperharwell
 
Artificial Intelligence in Biodiversity and Citizen Science
Artificial Intelligence in Biodiversity and Citizen ScienceArtificial Intelligence in Biodiversity and Citizen Science
Artificial Intelligence in Biodiversity and Citizen ScienceKatina Michael
 
Fictional Narrative Essay Worksheets
Fictional Narrative Essay WorksheetsFictional Narrative Essay Worksheets
Fictional Narrative Essay WorksheetsHeather Lopez
 
Essay On My Favourite Sportsman Sachin Tendulkar
Essay On My Favourite Sportsman Sachin TendulkarEssay On My Favourite Sportsman Sachin Tendulkar
Essay On My Favourite Sportsman Sachin TendulkarKris Hallengren
 
Smart Tap
Smart TapSmart Tap
Smart Tapluckd73
 
Water Supply In California
Water Supply In CaliforniaWater Supply In California
Water Supply In CaliforniaDawn Mora
 
Essay On My Native Town Kathmandu
Essay On My Native Town KathmanduEssay On My Native Town Kathmandu
Essay On My Native Town KathmanduBrittany Koch
 
Buy Your College Essay - Buy College Essay
Buy Your College Essay - Buy College EssayBuy Your College Essay - Buy College Essay
Buy Your College Essay - Buy College EssayTania Knapp
 
Writing Template With Drawing Box. Online assignment writing service.
Writing Template With Drawing Box. Online assignment writing service.Writing Template With Drawing Box. Online assignment writing service.
Writing Template With Drawing Box. Online assignment writing service.Jeanne Hall
 
Preparing for the Impact of Web 3.0
Preparing for the Impact of Web 3.0 Preparing for the Impact of Web 3.0
Preparing for the Impact of Web 3.0 Judy O'Connell
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach3 Round Stones
 
Ap Euro Practice Essay Questions. Online assignment writing service.
Ap Euro Practice Essay Questions. Online assignment writing service.Ap Euro Practice Essay Questions. Online assignment writing service.
Ap Euro Practice Essay Questions. Online assignment writing service.Nicole Barnes
 

Similar to Open tech digital cholera (20)

Kiss Of Love Essay
Kiss Of Love EssayKiss Of Love Essay
Kiss Of Love Essay
 
Essay On Picture Composition In Hindi
Essay On Picture Composition In HindiEssay On Picture Composition In Hindi
Essay On Picture Composition In Hindi
 
The Internet of Things and what it mean for librarians
The Internet of Things and what it mean for librariansThe Internet of Things and what it mean for librarians
The Internet of Things and what it mean for librarians
 
Cool Tools
Cool Tools Cool Tools
Cool Tools
 
Essay Bus Accident
Essay Bus AccidentEssay Bus Accident
Essay Bus Accident
 
Writing An Essay Plan. Online assignment writing service.
Writing An Essay Plan. Online assignment writing service.Writing An Essay Plan. Online assignment writing service.
Writing An Essay Plan. Online assignment writing service.
 
Social Bar - Open Data
Social Bar - Open DataSocial Bar - Open Data
Social Bar - Open Data
 
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx
7.1 Evaluating Information7.2 Neo-Luddite Views of Compute.docx
 
Artificial Intelligence in Biodiversity and Citizen Science
Artificial Intelligence in Biodiversity and Citizen ScienceArtificial Intelligence in Biodiversity and Citizen Science
Artificial Intelligence in Biodiversity and Citizen Science
 
Fictional Narrative Essay Worksheets
Fictional Narrative Essay WorksheetsFictional Narrative Essay Worksheets
Fictional Narrative Essay Worksheets
 
Essay On My Favourite Sportsman Sachin Tendulkar
Essay On My Favourite Sportsman Sachin TendulkarEssay On My Favourite Sportsman Sachin Tendulkar
Essay On My Favourite Sportsman Sachin Tendulkar
 
Smart Tap
Smart TapSmart Tap
Smart Tap
 
Water Supply In California
Water Supply In CaliforniaWater Supply In California
Water Supply In California
 
Breanna Hitchens E-waste
Breanna Hitchens E-wasteBreanna Hitchens E-waste
Breanna Hitchens E-waste
 
Essay On My Native Town Kathmandu
Essay On My Native Town KathmanduEssay On My Native Town Kathmandu
Essay On My Native Town Kathmandu
 
Buy Your College Essay - Buy College Essay
Buy Your College Essay - Buy College EssayBuy Your College Essay - Buy College Essay
Buy Your College Essay - Buy College Essay
 
Writing Template With Drawing Box. Online assignment writing service.
Writing Template With Drawing Box. Online assignment writing service.Writing Template With Drawing Box. Online assignment writing service.
Writing Template With Drawing Box. Online assignment writing service.
 
Preparing for the Impact of Web 3.0
Preparing for the Impact of Web 3.0 Preparing for the Impact of Web 3.0
Preparing for the Impact of Web 3.0
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
Ap Euro Practice Essay Questions. Online assignment writing service.
Ap Euro Practice Essay Questions. Online assignment writing service.Ap Euro Practice Essay Questions. Online assignment writing service.
Ap Euro Practice Essay Questions. Online assignment writing service.
 

More from Peter Wells

Practical data ethics
Practical data ethicsPractical data ethics
Practical data ethicsPeter Wells
 
#Shared smartcitiesworld data trusts - peter w - 2019-06-18
#Shared smartcitiesworld   data trusts - peter w - 2019-06-18#Shared smartcitiesworld   data trusts - peter w - 2019-06-18
#Shared smartcitiesworld data trusts - peter w - 2019-06-18Peter Wells
 
#Shared data for policy 2019 the different approaches to increasing access ...
#Shared data for policy 2019   the different approaches to increasing access ...#Shared data for policy 2019   the different approaches to increasing access ...
#Shared data for policy 2019 the different approaches to increasing access ...Peter Wells
 
Annual centre for competition policy conference - access to data, and more 20...
Annual centre for competition policy conference - access to data, and more 20...Annual centre for competition policy conference - access to data, and more 20...
Annual centre for competition policy conference - access to data, and more 20...Peter Wells
 
Rss characteristics of good data governance - data trusts - peter w - 2019-...
Rss   characteristics of good data governance - data trusts - peter w - 2019-...Rss   characteristics of good data governance - data trusts - peter w - 2019-...
Rss characteristics of good data governance - data trusts - peter w - 2019-...Peter Wells
 
Open source lab berlin - 2019 - understanding and monitoring city data ecos...
Open source lab   berlin - 2019 - understanding and monitoring city data ecos...Open source lab   berlin - 2019 - understanding and monitoring city data ecos...
Open source lab berlin - 2019 - understanding and monitoring city data ecos...Peter Wells
 
Launch of ODI 2019 data trust pilots work
Launch of ODI 2019 data trust pilots workLaunch of ODI 2019 data trust pilots work
Launch of ODI 2019 data trust pilots workPeter Wells
 
Fil presentation overview 2019-01-24
Fil   presentation overview 2019-01-24Fil   presentation overview 2019-01-24
Fil presentation overview 2019-01-24Peter Wells
 
Alan turing institute workshop what is a data trust - 2018 - peter wells -...
Alan turing institute workshop   what is a data trust  - 2018 - peter wells -...Alan turing institute workshop   what is a data trust  - 2018 - peter wells -...
Alan turing institute workshop what is a data trust - 2018 - peter wells -...Peter Wells
 
Retail week live 2018 gdpr and innovation - peter wells - open data institute
Retail week live 2018   gdpr and innovation - peter wells - open data instituteRetail week live 2018   gdpr and innovation - peter wells - open data institute
Retail week live 2018 gdpr and innovation - peter wells - open data institutePeter Wells
 
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017Peter Wells
 
open data and advocacy - eu datahon november 2017
open data and advocacy - eu datahon november 2017open data and advocacy - eu datahon november 2017
open data and advocacy - eu datahon november 2017Peter Wells
 
Travelspirit 2017 the opportunity of open - peter w presentation (1)
Travelspirit 2017   the opportunity of open - peter w presentation (1)Travelspirit 2017   the opportunity of open - peter w presentation (1)
Travelspirit 2017 the opportunity of open - peter w presentation (1)Peter Wells
 
2017-09-07 using data to create impact- some policy design patterns - data fo...
2017-09-07 using data to create impact- some policy design patterns - data fo...2017-09-07 using data to create impact- some policy design patterns - data fo...
2017-09-07 using data to create impact- some policy design patterns - data fo...Peter Wells
 
Mydata2017 case studies - open banking - peter wells - open data institute
Mydata2017   case studies - open banking - peter wells - open data instituteMydata2017   case studies - open banking - peter wells - open data institute
Mydata2017 case studies - open banking - peter wells - open data institutePeter Wells
 
Mydata2017 ourdata track - mydata ourdata symbiotic relationship - peter we...
Mydata2017   ourdata track - mydata ourdata symbiotic relationship - peter we...Mydata2017   ourdata track - mydata ourdata symbiotic relationship - peter we...
Mydata2017 ourdata track - mydata ourdata symbiotic relationship - peter we...Peter Wells
 
2017 presentation - peter wells - public sector cloud - data infrastructure
2017   presentation - peter wells - public sector cloud - data infrastructure2017   presentation - peter wells - public sector cloud - data infrastructure
2017 presentation - peter wells - public sector cloud - data infrastructurePeter Wells
 
Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017Peter Wells
 
Open your effing data presentation 2017
Open your effing data presentation 2017Open your effing data presentation 2017
Open your effing data presentation 2017Peter Wells
 
Odi fridays 201611 gov cats
Odi fridays 201611 gov catsOdi fridays 201611 gov cats
Odi fridays 201611 gov catsPeter Wells
 

More from Peter Wells (20)

Practical data ethics
Practical data ethicsPractical data ethics
Practical data ethics
 
#Shared smartcitiesworld data trusts - peter w - 2019-06-18
#Shared smartcitiesworld   data trusts - peter w - 2019-06-18#Shared smartcitiesworld   data trusts - peter w - 2019-06-18
#Shared smartcitiesworld data trusts - peter w - 2019-06-18
 
#Shared data for policy 2019 the different approaches to increasing access ...
#Shared data for policy 2019   the different approaches to increasing access ...#Shared data for policy 2019   the different approaches to increasing access ...
#Shared data for policy 2019 the different approaches to increasing access ...
 
Annual centre for competition policy conference - access to data, and more 20...
Annual centre for competition policy conference - access to data, and more 20...Annual centre for competition policy conference - access to data, and more 20...
Annual centre for competition policy conference - access to data, and more 20...
 
Rss characteristics of good data governance - data trusts - peter w - 2019-...
Rss   characteristics of good data governance - data trusts - peter w - 2019-...Rss   characteristics of good data governance - data trusts - peter w - 2019-...
Rss characteristics of good data governance - data trusts - peter w - 2019-...
 
Open source lab berlin - 2019 - understanding and monitoring city data ecos...
Open source lab   berlin - 2019 - understanding and monitoring city data ecos...Open source lab   berlin - 2019 - understanding and monitoring city data ecos...
Open source lab berlin - 2019 - understanding and monitoring city data ecos...
 
Launch of ODI 2019 data trust pilots work
Launch of ODI 2019 data trust pilots workLaunch of ODI 2019 data trust pilots work
Launch of ODI 2019 data trust pilots work
 
Fil presentation overview 2019-01-24
Fil   presentation overview 2019-01-24Fil   presentation overview 2019-01-24
Fil presentation overview 2019-01-24
 
Alan turing institute workshop what is a data trust - 2018 - peter wells -...
Alan turing institute workshop   what is a data trust  - 2018 - peter wells -...Alan turing institute workshop   what is a data trust  - 2018 - peter wells -...
Alan turing institute workshop what is a data trust - 2018 - peter wells -...
 
Retail week live 2018 gdpr and innovation - peter wells - open data institute
Retail week live 2018   gdpr and innovation - peter wells - open data instituteRetail week live 2018   gdpr and innovation - peter wells - open data institute
Retail week live 2018 gdpr and innovation - peter wells - open data institute
 
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017
data and policy - presentation at Data gedreven Beleidsontwikkeling Nov 2017
 
open data and advocacy - eu datahon november 2017
open data and advocacy - eu datahon november 2017open data and advocacy - eu datahon november 2017
open data and advocacy - eu datahon november 2017
 
Travelspirit 2017 the opportunity of open - peter w presentation (1)
Travelspirit 2017   the opportunity of open - peter w presentation (1)Travelspirit 2017   the opportunity of open - peter w presentation (1)
Travelspirit 2017 the opportunity of open - peter w presentation (1)
 
2017-09-07 using data to create impact- some policy design patterns - data fo...
2017-09-07 using data to create impact- some policy design patterns - data fo...2017-09-07 using data to create impact- some policy design patterns - data fo...
2017-09-07 using data to create impact- some policy design patterns - data fo...
 
Mydata2017 case studies - open banking - peter wells - open data institute
Mydata2017   case studies - open banking - peter wells - open data instituteMydata2017   case studies - open banking - peter wells - open data institute
Mydata2017 case studies - open banking - peter wells - open data institute
 
Mydata2017 ourdata track - mydata ourdata symbiotic relationship - peter we...
Mydata2017   ourdata track - mydata ourdata symbiotic relationship - peter we...Mydata2017   ourdata track - mydata ourdata symbiotic relationship - peter we...
Mydata2017 ourdata track - mydata ourdata symbiotic relationship - peter we...
 
2017 presentation - peter wells - public sector cloud - data infrastructure
2017   presentation - peter wells - public sector cloud - data infrastructure2017   presentation - peter wells - public sector cloud - data infrastructure
2017 presentation - peter wells - public sector cloud - data infrastructure
 
Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017
 
Open your effing data presentation 2017
Open your effing data presentation 2017Open your effing data presentation 2017
Open your effing data presentation 2017
 
Odi fridays 201611 gov cats
Odi fridays 201611 gov catsOdi fridays 201611 gov cats
Odi fridays 201611 gov cats
 

Recently uploaded

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 

Recently uploaded (20)

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 

Open tech digital cholera

  • 1. Digital Cholera Peter Wells - @peterkwells OpenTech - June 2015
  • 2. Cholera The last time I was in this building I went to a talk on an early example of data analysis and data visualisation. John Snow famously traced a fatal cholera epidemic in Soho in 1854 to a local water pump. Because of cholera in the pump the water was not safe to use. Read more about John Snow: http://en.wikipedia.org/wiki/John_Snow_%28physician%29 @peterkwells
  • 3. Cholera and infrastructure The Soho outbreak started at a water pump, it could have been a water reservoir. The cholera bacteria would spread and contaminate the water downstream. An entire set of water infrastructure could have been contaminated. The water would not have been safe to use. Yet water is essential to life. Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/ @peterkwells
  • 4. Safe water As a society we invest in water infrastructure. We have: - inspections - alerting systems - purification - education We put more focus at the top of the infrastructure, on water producers and distributors, than we do on water users. The goal is to make water that’s safe for people to use. A Doctor from the World Health Organisation @peterkwells
  • 5. Get to the digital... @peterkwells
  • 6. Open Addresses Organisations have to buy lists of UK addresses, licensing is complicated, the quality isn’t great, the data doesn’t meet all the needs. It’s hard to build new services. Open Addresses explored whether it was possible to build a new UK address list, to make things simpler and make addresses more widely used. @peterkwells
  • 7. Addressing needs Denmark had a 1000% increase in the organisations that use address data by making address data simpler to use. We discovered other needs and benefits: - people who move into new houses need their addresses to be published faster - people name their houses and need other people to know about it - people need it to be easier to enter addresses on websites - (I could go on…) @peterkwells More and better services that would make life a little bit easier
  • 8. Getting addresses As well as understanding the needs we had to find data. There are 26-40m addresses in the UK. The Land Registry publishes over 18 million addresses in the Price Paid Dataset. Sounds great! @peterkwells Aside: we also did some neat stuff on mathematical inference for addresses. Check out www.openaddressesuk.org...
  • 9. Land Registry says no... Image from Owen Boswarva: http://mapgubbins.tumblr.com/post/107499166390/it-was-all-a-dream-land-registrys-price-paid @peterkwells
  • 10. Third Party Rights are complex and can be fatal Address datasets can include third-party database rights: 1. if the data was directly copied from an existing address database 2. if an existing list of addresses (obtained through another route) was corrected or validated based on an existing address database Unauthorised use of third party rights creates risk for both data publishers and consumers. The service can simply…... stop. @peterkwells
  • 11. Third party rights, they’re everywhere! As we inspected other datasets we saw similar issues with unauthorised rights: - websites for data capture that used third party address products - datasets that had been cleansed with third party address products - a clean website followed by automated back-end validation Even with submission guidelines, provenance tracking and takedown policies the legal position for Open Addresses was really complex. We made a :( @peterkwells
  • 12. Lightbulb It is complicated to determine if unauthorised third party rights exist. You need to inspect the data and how it was produced @peterkwells Image by Richard Rutter: https://www. flickr.com/photos/clagnut/
  • 13. Safe water - a reprise As a society we invest in water infrastructure: - inspections - alerting systems - purification - education We put more focus at the top of the infrastructure, on water producers and distributors, than we do on water users. The goal is to make water that’s safe for people to use. Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/ @peterkwells A Doctor from the World Health Organisation
  • 14. Digital cholera @peterkwells Copyright is a good thing (don’t believe me? ask a musician) so I’m using a harsh metaphor, but the metaphor is useful. Don’t take away my copyright!
  • 15. Digital cholera @peterkwells The water may be infected with cholera. Therefore we inspect it to see if the water is safe to use. Land Registry address data may be infected with digital cholera. Therefore we inspect it to see if the data is safe to use. We learnt it wasn’t so we didn’t….
  • 16. Digital cholera @peterkwells Not just about unauthorised third party rights. Inappropriate releases of personal data. Incomplete data. Incorrect data. Remember it’s a metaphor.
  • 17. Digital cholera @peterkwells Can we learn more from how society learnt to deal with cholera in water?
  • 18. Alerting system? @peterkwells We’ve told Land Registry of the problem(s). We’ve published articles to alert others. We’re here. Should this be better?
  • 19. Purification? @peterkwells Tricky. There is no equivalent of a purification tablet. We need to cleanse data infrastructure of digital cholera or we need to rebuild it. It is simplest if the data is kept pure by whoever creates and maintains it. Just as with water.
  • 20. Education @peterkwells The ODI already have a wealth of education material and are including the thinking and learning from Open Addresses in some future work: Send your ideas more here:http://theodi.org/who-owns-our-data-infrastructure?
  • 21. Water is essential to life so we invest in maintaining our water infrastructure to make water safe to use. Data gives us more and better services. It is is essential to life. We need to invest in maintaining useful data infrastructure to make data safe to use. @peterkwells
  • 22. @peterkwellsImage by Don Graham: https://www.flickr.com/photos/23155134@N06/ If we don’t look after our data infrastructure we risk simply ending up with some rusty and unused data pumps….