SlideShare a Scribd company logo
1 of 33
Download to read offline
Address Day
what next after the Address Wars
Jeni Tennison - @JeniT
5 March 2015
https://openaddressesuk.org
@openaddressesuk
In economics, a public good is a good that is
both non-excludable and non-rivalrous in
that individuals cannot be effectively
excluded from use and where use by one
individual does not reduce availability to
others.
Wikipedia - Public good
"Tompkins Square Park Central Knoll" by David Shankbone - (CC BY-SA 3.0) via Wikimedia Commons
open
data
public
good
sum of what
everyone
would pay
what it costs
to maintain
When should a good be public?
Address data should be open data
● National Information Infrastructure
● Not just for posting mail...
○ geocoding for route finding
○ associating people with areas
○ classification for targeting interventions
○ linking datasets together
● Denmark has taken this step
○ 1000% increase use of address data
○ costs = €0.2M - benefits = €14M
Current real life problems
● startup wanting to build an application
○ prohibitive costs
○ prohibitive licensing complexity
● SME with a geodemographic product
○ prohibitive costs
○ limiting customer base & growth
● New build owners
○ 3 months to register to vote, order pizza
Funding public goods
● Government via taxation
● Collaborative bound by contract
● Cross-subsidy by selling other goods
● Voluntary effort
● Social norms
"The sale of the PAF with the Royal Mail was a mistake.
Public access to public sector data must never be sold or
given away again. This type of information, like census
information and many other data sets, is very expensive
to collect and collate into useable form, but it also has
huge potential value to the economy and society as a
whole if it is kept as an open, public good."
Bernard Jenkin, Chair of Public Administration Select Committee
Hypothesis 1: the maintenance of open address
data can only be effectively funded through
taxation
Hypothesis 2: it is possible to build and maintain
a sustainable open address database using
collaboration, cross-subsidy and voluntary effort
Goals
● Free, openly licensed, up-to-date bulk
downloads of addresses
● Freemium services over that data
○ eg validation, auto-completion, geocoding
● 100% open source, collaboratively
maintained
● Initial ~£400k investment from government
○ compared with £25M annual cost maintaining PAF
Eventual Architecture
“Definitive” UK address list
- where the address data is safe to use
- where each record has confidence and provenance
Bulk
- Download
- Upload
APIs
- Add
- Sort
- Validate
- Search
URLs
- Linked data
- Extensibility
Service Providers
Aggregators, digital, telecoms, public sector, distribution, academics, manufacturers etc
Services
- Websites,
Users
Value
Revenueforsustainability
This takes time
Large
datasets and
inference to
tackle the
bulk of the
challenge
“80/20” rule
Ongoing,
collaborative
maintenance
Targeted
work. Low-
volume
records to fill
existing gaps
in available
datasets
NB: dates are “just for fun”
Approaches
1. Load open datasets containing addresses
2. Build out crowdsourcing mechanisms
3. Use inference to fill gaps
and throughout:
● keep track of provenance
● keep track of confidence
Loading datasets
Third Party IPR
Possibly infected if validated
against PAF or AddressBase
⇒ most Government “open”
data is infected
A few not:
● Companies House
● err...
Platform for loading bulk data
Originally developed for OpenCorporates
Sandboxed environment for running scripts
Motivating crowdsourcing
Bulk
- Download
- Upload
APIs
- Add
- Sort
- Validate
- Search
URLs
- Linked data
- Extensibility
Value
Building Blocks
- towns, postcodes, streets
- used to parse data and provide
confidence in the address list
- links between towns, postcodes
and streets are learned from
addresses
Authoritative and definitive UK
address list
- where the address data is safe to
use
- where each record has
confidence and provenance
Revenueforsustainability
● Turn free-text
addresses into
building blocks
● Can be used with data
containing third party
IPR
● Optional “contribute”
option
Address parsing service
Inference
Fogralea
ZE1 0SE
© Open Addresses Ltd.
7 9 11 13 15 17 19 21 23 25 27 29
6 8 10 12 14 16 18 20 22 24 26 28
Fogralea
ZE1 0SE
7 9 11 13 15 17 19 21 23 25 27 29
6 8 10 12 14 16 18 20 22 24 26 28
Fogralea
ZE1 0SE
What about
nos. 1 to 4?
Same
postcode? We
cannot know!
Fogralea
ZE1 0SE
Enabling collaborative maintenance
St James House, St James Square, Cheltenham, GL50 3PR
7, St James Square, Cheltenham, GL50 3PT
St James North 1, St James Square, Cheltenham, GL50 3PR
St James North 3, St James Square, Cheltenham, GL50 3PR
3, St James Square, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham Spa, GL50 3PR
St James North 1, St James Square, Cheltenham, GL50 3PR
St James Place, Jessop Avenue, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham, GL50 3PR
Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR
56, Cheltenham Road, London, SE15 3AR
Calculating confidence
St James House, St James Square, Cheltenham, GL50 3PR
7, St James Square, Cheltenham, GL50 3PT
St James North 1, St James Square, Cheltenham, GL50 3PR
St James North 3, St James Square, Cheltenham, GL50 3PR
3, St James Square, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham Spa, GL50 3PR
St James North 1, St James Square, Cheltenham, GL50 3PR
St James Place, Jessop Avenue, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham, GL50 3PR
Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR
56, Cheltenham Road, London, SE15 3AR
Calculating confidence
Sector Town Count Total Confidence
...
HD3 4 HUDDERSFIELD 66 66 87.71%
...
DG8 6 NEWTON STEWART 11 12 65.69%
DG8 6 STRANRAER 1 12 0.00%
DG8 7 NEWTON STEWART 1 1 0.00%
...
W3 6 LONDON 196 196 92.96%
...
CH44 4 WALLASEY 23 29 76.06%
CH44 4 WIRRAL 6 29 8.22%
Calculating confidence
This postcode/town association is right but
confidence is low because of the low count
This postcode/town association is incorrect
Another correct postcode/town association,
but with a higher count
This is what happens when post towns are
re-organised; Wirral is now split in
Birkenhead, Wallasey, Wirral and Prenton
This is how a correct postcode/town
association looks like
Provenance
Summary
● Built most of the supporting platform
○ parsing free text / messy addresses
○ collaborative loading of data
○ providing downloads, search & URL identity
○ recording provenance & assigning confidence
○ using inference to fill in gaps
● We have low numbers of addresses currently
○ but the right mechanisms to add more
○ and many potential partners
What next?
● Building the platform
● Building the community of collaborators
● Building services to aid cross-subsidy
● Increasing quantity & quality of addresses
● Can anyone else reuse the technology?
● Can anyone else reuse the approach?
Any Questions?
@JeniT - jeni.tennison@openaddressesuk.org
https://openaddressesuk.org
info@openaddressesuk.org
@openaddressesuk
Open Addresses Ltd. is a new company being set
up to create and maintain an address database
for the UK that will be made available to the
public as Open Data. It will facilitate the
collaborative maintenance of the address
database with various stakeholders from the UK
Government, industry and non-profit.
Offices
Where?

More Related Content

Similar to BCS Address Day - Open Addresses

ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014Jacqui Taylor
 
LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014LG Inform Plus
 
Providing Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public SectorProviding Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public SectorGiuseppe Sollazzo
 
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...Mark Braggins
 
Open Addresses - for Bath Hacked
Open Addresses - for Bath HackedOpen Addresses - for Bath Hacked
Open Addresses - for Bath HackedOpenAddressesUK
 
Developing an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons LearnedDeveloping an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons LearnedAndrew Stott
 
ODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deckODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deckJacqui Taylor
 
Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014Gesche Schmid
 
Local Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche SchmidLocal Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche SchmidOpening-up.eu
 
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdfCHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdfissane
 
The Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled ReformThe Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled Reformdavidircameron
 
Open Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest EvidenceOpen Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest EvidenceAndrew Stott
 
Local open data strategy 2015 03-19
Local open data strategy  2015 03-19 Local open data strategy  2015 03-19
Local open data strategy 2015 03-19 Gesche Schmid
 
How Important is location to your business
How Important is location to your businessHow Important is location to your business
How Important is location to your businessNick Turner
 
Intro To Procurement And Tendering In Wales Slideset
Intro To Procurement And Tendering In Wales   SlidesetIntro To Procurement And Tendering In Wales   Slideset
Intro To Procurement And Tendering In Wales SlidesetDawn_Rowley
 
Open Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons LearnedOpen Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons LearnedAndrew Stott
 
Local Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire InsightLocal Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire InsightMark Braggins
 
Lga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open dataLga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open dataGesche Schmid
 

Similar to BCS Address Day - Open Addresses (20)

ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014
 
LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014
 
Providing Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public SectorProviding Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public Sector
 
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
 
Open Addresses - for Bath Hacked
Open Addresses - for Bath HackedOpen Addresses - for Bath Hacked
Open Addresses - for Bath Hacked
 
Developing an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons LearnedDeveloping an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons Learned
 
ODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deckODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deck
 
Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014
 
Local Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche SchmidLocal Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche Schmid
 
GEOFF CONNELL: Better Connected live 2016
GEOFF CONNELL: Better Connected live 2016GEOFF CONNELL: Better Connected live 2016
GEOFF CONNELL: Better Connected live 2016
 
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdfCHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
 
The Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled ReformThe Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled Reform
 
Open Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest EvidenceOpen Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest Evidence
 
Local open data strategy 2015 03-19
Local open data strategy  2015 03-19 Local open data strategy  2015 03-19
Local open data strategy 2015 03-19
 
How Important is location to your business
How Important is location to your businessHow Important is location to your business
How Important is location to your business
 
Intro To Procurement And Tendering In Wales Slideset
Intro To Procurement And Tendering In Wales   SlidesetIntro To Procurement And Tendering In Wales   Slideset
Intro To Procurement And Tendering In Wales Slideset
 
Open Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons LearnedOpen Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons Learned
 
Local Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire InsightLocal Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire Insight
 
T and od v2
T and od v2T and od v2
T and od v2
 
Lga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open dataLga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open data
 

More from Jeni Tennison

The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureJeni Tennison
 
Collisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web ContentCollisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web ContentJeni Tennison
 
Data All the Way Down
Data All the Way DownData All the Way Down
Data All the Way DownJeni Tennison
 
Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Jeni Tennison
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be WonJeni Tennison
 
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFaOpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFaJeni Tennison
 

More from Jeni Tennison (6)

The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructure
 
Collisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web ContentCollisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web Content
 
Data All the Way Down
Data All the Way DownData All the Way Down
Data All the Way Down
 
Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
 
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFaOpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
 

Recently uploaded

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

BCS Address Day - Open Addresses

  • 1. Address Day what next after the Address Wars Jeni Tennison - @JeniT 5 March 2015 https://openaddressesuk.org @openaddressesuk
  • 2. In economics, a public good is a good that is both non-excludable and non-rivalrous in that individuals cannot be effectively excluded from use and where use by one individual does not reduce availability to others. Wikipedia - Public good
  • 3. "Tompkins Square Park Central Knoll" by David Shankbone - (CC BY-SA 3.0) via Wikimedia Commons
  • 5. sum of what everyone would pay what it costs to maintain When should a good be public?
  • 6. Address data should be open data ● National Information Infrastructure ● Not just for posting mail... ○ geocoding for route finding ○ associating people with areas ○ classification for targeting interventions ○ linking datasets together ● Denmark has taken this step ○ 1000% increase use of address data ○ costs = €0.2M - benefits = €14M
  • 7. Current real life problems ● startup wanting to build an application ○ prohibitive costs ○ prohibitive licensing complexity ● SME with a geodemographic product ○ prohibitive costs ○ limiting customer base & growth ● New build owners ○ 3 months to register to vote, order pizza
  • 8. Funding public goods ● Government via taxation ● Collaborative bound by contract ● Cross-subsidy by selling other goods ● Voluntary effort ● Social norms
  • 9. "The sale of the PAF with the Royal Mail was a mistake. Public access to public sector data must never be sold or given away again. This type of information, like census information and many other data sets, is very expensive to collect and collate into useable form, but it also has huge potential value to the economy and society as a whole if it is kept as an open, public good." Bernard Jenkin, Chair of Public Administration Select Committee
  • 10. Hypothesis 1: the maintenance of open address data can only be effectively funded through taxation Hypothesis 2: it is possible to build and maintain a sustainable open address database using collaboration, cross-subsidy and voluntary effort
  • 11.
  • 12. Goals ● Free, openly licensed, up-to-date bulk downloads of addresses ● Freemium services over that data ○ eg validation, auto-completion, geocoding ● 100% open source, collaboratively maintained ● Initial ~£400k investment from government ○ compared with £25M annual cost maintaining PAF
  • 13. Eventual Architecture “Definitive” UK address list - where the address data is safe to use - where each record has confidence and provenance Bulk - Download - Upload APIs - Add - Sort - Validate - Search URLs - Linked data - Extensibility Service Providers Aggregators, digital, telecoms, public sector, distribution, academics, manufacturers etc Services - Websites, Users Value Revenueforsustainability
  • 14. This takes time Large datasets and inference to tackle the bulk of the challenge “80/20” rule Ongoing, collaborative maintenance Targeted work. Low- volume records to fill existing gaps in available datasets NB: dates are “just for fun”
  • 15. Approaches 1. Load open datasets containing addresses 2. Build out crowdsourcing mechanisms 3. Use inference to fill gaps and throughout: ● keep track of provenance ● keep track of confidence
  • 16. Loading datasets Third Party IPR Possibly infected if validated against PAF or AddressBase ⇒ most Government “open” data is infected A few not: ● Companies House ● err...
  • 17. Platform for loading bulk data Originally developed for OpenCorporates Sandboxed environment for running scripts
  • 18. Motivating crowdsourcing Bulk - Download - Upload APIs - Add - Sort - Validate - Search URLs - Linked data - Extensibility Value Building Blocks - towns, postcodes, streets - used to parse data and provide confidence in the address list - links between towns, postcodes and streets are learned from addresses Authoritative and definitive UK address list - where the address data is safe to use - where each record has confidence and provenance Revenueforsustainability
  • 19. ● Turn free-text addresses into building blocks ● Can be used with data containing third party IPR ● Optional “contribute” option Address parsing service
  • 21. Fogralea ZE1 0SE © Open Addresses Ltd.
  • 22. 7 9 11 13 15 17 19 21 23 25 27 29 6 8 10 12 14 16 18 20 22 24 26 28 Fogralea ZE1 0SE
  • 23. 7 9 11 13 15 17 19 21 23 25 27 29 6 8 10 12 14 16 18 20 22 24 26 28 Fogralea ZE1 0SE
  • 24. What about nos. 1 to 4? Same postcode? We cannot know! Fogralea ZE1 0SE
  • 26. St James House, St James Square, Cheltenham, GL50 3PR 7, St James Square, Cheltenham, GL50 3PT St James North 1, St James Square, Cheltenham, GL50 3PR St James North 3, St James Square, Cheltenham, GL50 3PR 3, St James Square, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham Spa, GL50 3PR St James North 1, St James Square, Cheltenham, GL50 3PR St James Place, Jessop Avenue, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham, GL50 3PR Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR 56, Cheltenham Road, London, SE15 3AR Calculating confidence
  • 27. St James House, St James Square, Cheltenham, GL50 3PR 7, St James Square, Cheltenham, GL50 3PT St James North 1, St James Square, Cheltenham, GL50 3PR St James North 3, St James Square, Cheltenham, GL50 3PR 3, St James Square, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham Spa, GL50 3PR St James North 1, St James Square, Cheltenham, GL50 3PR St James Place, Jessop Avenue, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham, GL50 3PR Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR 56, Cheltenham Road, London, SE15 3AR Calculating confidence
  • 28. Sector Town Count Total Confidence ... HD3 4 HUDDERSFIELD 66 66 87.71% ... DG8 6 NEWTON STEWART 11 12 65.69% DG8 6 STRANRAER 1 12 0.00% DG8 7 NEWTON STEWART 1 1 0.00% ... W3 6 LONDON 196 196 92.96% ... CH44 4 WALLASEY 23 29 76.06% CH44 4 WIRRAL 6 29 8.22% Calculating confidence This postcode/town association is right but confidence is low because of the low count This postcode/town association is incorrect Another correct postcode/town association, but with a higher count This is what happens when post towns are re-organised; Wirral is now split in Birkenhead, Wallasey, Wirral and Prenton This is how a correct postcode/town association looks like
  • 30. Summary ● Built most of the supporting platform ○ parsing free text / messy addresses ○ collaborative loading of data ○ providing downloads, search & URL identity ○ recording provenance & assigning confidence ○ using inference to fill in gaps ● We have low numbers of addresses currently ○ but the right mechanisms to add more ○ and many potential partners
  • 31. What next? ● Building the platform ● Building the community of collaborators ● Building services to aid cross-subsidy ● Increasing quantity & quality of addresses ● Can anyone else reuse the technology? ● Can anyone else reuse the approach?
  • 32. Any Questions? @JeniT - jeni.tennison@openaddressesuk.org https://openaddressesuk.org info@openaddressesuk.org @openaddressesuk
  • 33. Open Addresses Ltd. is a new company being set up to create and maintain an address database for the UK that will be made available to the public as Open Data. It will facilitate the collaborative maintenance of the address database with various stakeholders from the UK Government, industry and non-profit. Offices Where?