SlideShare a Scribd company logo
1 of 52
Addresses Vs Households - RSS July ‘17
Building an address index for
census and beyond
Alistair Calder
Head of Addressing
Data Architecture - ONS
Mike James
Head of Address Research
Data Architecture - ONS
Addresses
• ONS Requirements – and why it has now become easy
• Issues – and why it is still really hard
• Addressing in Government - joining up
• Addresses and Admin data – building quality
• Demo
(& an annexe)
CENSUS OPERATION 2021
CCS
Enforce
-ment
Build
register
ADDRESS
LIST
Post
Out
IAC
Hand
Deliver
HOUSE
HOLDS
COMMUNAL
ESTABLISHMENTS
100K ?
29M ?
ONLINE
Completion
Paper
??
Track
response
Follow
Up
Reminder
Letters
Emails
RESPONSE
DATABASE
Estimation
& outputs
Admin
data
OUTPUT
DATABASE
ESTIMATION
CENSUS OPERATION 2021
Post
Out
IAC
Hand
Deliver
ONLINE
Completion
Paper
??
CCS
Enforce
-ment
Track
response
Build
register
ADDRESS
LIST
Follow
Up
HOUSE
HOLDS
COMMUNAL
ESTABLISHMENTS
100K ?
29M ?
Estimation
& outputs
Admin
data
OUTPUT
DATABASE
Reminder
Letters
emails
RESPONSE
DATABASE
ESTIMATION
and …
• Social Survey – survey frames
• Online Survey transformation - eQ
• Data collection eg Life Events – locating events
• Business & economic statistics – locating & linking
• Admin data – spine for linking & integrating
• Other government departments
• etc
The requirement (tbc)
• A ‘complete’ household frame
>99% of household spaces ( addresses)
• Minimal over-coverage
duplicates / commercial / demolished etc < 2 or 3% ?
• A brilliant (integrated) communal frame
• Residential, communal & business (& non postal)
• Up to date, correctly located etc etc …. And more
Postcode
Address
File
PAF
Valuation
Office
National Land
& Property
Gazetteer
NLPG
Address
Layer 2
AL2
LLPGs
LLPGs
x 348
Council
Tax
TV
License
Utilities
Emerg.
Services
etc
Addressing for Census
The olden days
Addressing – Census 2011
PAF
AL2
VOA
Residential address list
RULES CLERICAL
Communal address list
NLPG
Very good Just good enough
Why it’s going
to be easy
this time ….
PAF
AL2
VOA
Residential address list
RULES
NLPG
Residential address list
PAF
AL2
VOA
RULES
NLPG
Communal address list
Additional CE sources
2011 Census AddressBase
Why it’s still
really hard….
The challenge ..... Why it’s hard this time
• We have an excellent starting point but addresses are
complicated and change a lot. There will be error & error
clusters itself in the areas we care about the most – Very
difficult to check quality
• Extracting the right ones is difficult. Small errors can be
significant – and cause trauma
• Communals are important and particularly challenging
• We plan to do MUCH more with addresses than post-out – huge
opportunity but attribute thinking is new
• Addresses are complex so matching is really hard
The Emerging Strategy
what’s the plan?
Flat 1 Flat 2
Flat 3 Flat 4
Flat 5 Flat 6
Flat 7
7
The Emerging Strategy
what’s the plan?
42 5 5? B
10
The challenge ..... Why it’s hard this time
• We have an excellent starting point but addresses are
complicated and change a lot. There will be error & error
clusters itself in the areas we care about the most – Very
difficult to check quality
• Extracting the right ones is difficult. Small errors can be
significant – and cause trauma
• Communals are important and particularly challenging
• We plan to do MUCH more with the register than post-out –
huge opportunity but attribute thinking is new
• Addresses are complex so matching is really hard
The challenge ..... Why it’s hard this time
• We have an excellent starting point but addresses are
complicated and change a lot. There will be error & error
clusters itself in the areas we care about the most – Very
difficult to check quality
• Extracting the right ones is difficult. Small errors can be
significant – and cause trauma
• Communals are important and particularly challenging
• We plan to do MUCH more with addresses than post-out – huge
opportunity but attribute thinking is new
• Addresses are complex so matching is really hard
Lists of
communals
Compared to
counts from
admin data
Linked to
Business
Index
Linked to
Address
Index
The challenge ..... Why it’s hard this time
• We have an excellent starting point but addresses are
complicated and change a lot. There will be error & error
clusters itself in the areas we care about the most – Very
difficult to check quality
• Extracting the right ones is difficult. Small errors can be
significant – and cause trauma
• Communals are important and particularly challenging
• We plan to do MUCH more with addresses than post-out – huge
opportunity but attribute thinking is new
• Addresses are complex so matching is really hard
A probabilistic address frame
Probability of
• Existence of address
• type - HH/B/CE
• HH Size / structure
• Change / churn
• Hard to countness / category
• (multivariate >> categorisation
• Eg possible holiday home, carehome, student
accommodation
Address
Register
HH
Structure
2011
Census
HH structure,
churn, names
Activity data
Energy, utilities,
broadband, health,
house sales
Admin data
HH structure, churn,
names, house
prices, phone
numbers
Other
Shape / pattern
recognition
Survey paradata
Geoplace
And other CE sources
CE
New definition / schema
Inform field planning / targetting
Intelligent stratification
Prioritise follow up (address level)
Inform estimation & modelling
B
Business Reg
Business structure,
type, churn
Conceptually – all subject to ethical and privacy discussion !
Potentially
The challenge ..... Why it’s hard this time
• We have an excellent starting point but addresses are
complicated and change a lot. There will be error & error
clusters itself in the areas we care about the most – Very
difficult to check quality
• Extracting the right ones is difficult. Small errors can be
significant – and cause trauma
• Communals are important and particularly challenging
• We plan to do MUCH more with addresses than post-out – huge
opportunity but attribute thinking is new
• Addresses are complex so matching is really hard
•
The bigger picture
– addresses as a linking
mechanism across government
Government Digital Service (GDS)
Vision for Registers
ONS or
citizen
servicesingle
address UPRN
10 High St PO15 5RR 1234567891011
batch of
addresses
addresses
UPRNs
batch
match
Addressbase load
UPRNs
addresses
classifications
Feedback
to source
(improving quality)
api
api
ONS Data Library
Address
Index
Business
Index
Address Matching - Beta
correct match rate
virtually zero false positives
balance between automatic & clerical
flexibility of match tuned but not limited
fast
scalable
accessible via api
non proprietary code -> open
Searching and matching – what we want
Avenue Cars Limted
1st Floor
St. William of York House
22-24 First Road,
Street, Somerset
ZE1ODW
synonyms
thesaurus
aliases
lookups
Parsing
Rules based +
Machine learning /
Natural language
Source
input address
address
components
how we are going to do it
Informed
decision –
clerical
intervention
HOPPER SCORE
Confidence rank
of options
ES
Fuzzy matching
Distance measures
synonyms
thesaurus
aliases
lookups
Parsing
Rules based +
Machine learning /
Natural language
Source
input address
address
components
AddressBase
hierarchies
ESindexes
Hopper score
User testing
Alpha – Address
Index build
2015 2016 2017 2018 2019 2020 2021 2022 2023April July October April July October
On-line Survey
transformation
Admin
Data
Admin Data –
Processing
Platform
Alpha
EDC – eQ Alpha EDC – eQ Beta
EDC – Response and Respondent Management Beta
Admin Data – Processing Platform Beta
EDC – Service enhancement
Admin/Survey
Integration
Discovery
Admin/Survey
Integration –
Alpha
Admin/Survey Integration – Beta
Alpha - Business
Index build
Beta - Business index
build
Beta - Address index
build
Registers
2019
Census
Rehearsal
Admin
Data for
Census
Census
Register / Index Platform for ONS
Live services
Decision to
proceed to
beta Develop data migration and data loader for new
BIS data source
IDBR Service Migration
IDBR Migration
Roadmap
Business Statistics
Decision(s) to go
live
2021
Census
Life Events, Social Survey etc etc
The Address Register in an Admin Data Census
• What is the role of the Address Register
• Address Register Quality
• Address Matching Demo (what could possibly go
wrong…?)
A perfect address register won't overcome all the
issues of moving from HH to address definition
But it sure would be helpful…
The Address Register in an Admin Data Census
People on Admin Data
Address Register
Address Register Quality
People on Admin Data
Address Register
Over Coverage
Address Register Quality
People on Admin Data
Address Register
Under Coverage
Over Coverage
Matching Addresses to the Address Register
People on Admin Data
Address Register
Under Coverage
Over CoverageAddress Matching
Citizen Address Search
Citizens Identifying Their Address in Admin Data
Address Register
Under Coverage
Over Coverage
I Live There
Strategy for Delivering Quality
• Using AddressBase Premium (ABP)
– 2.2M more residential addresses than PAF
• Close partnership with Geoplace
• Lots of LA engagement
• Supporting the use of ABP in Government – embed Unique Property
Reference Number (UPRN) throughout government data
• Understanding types of error, their causes and
impacts
– Over coverage (duplication, misclassifications)
• Non-existent annexes (included by some idiot…..)
– Under coverage (missed HMO, missing new builds,
misclassifications)
– Single instance or clustered?
Methods & Evidence of Quality
• Over-Coverage
– Social survey outcomes (does the sample include non-
residential addresses?)
• Using ABP to clean PAF removes majority of non-residential addresses
– Analysing Census tests
• Number and cause on non-deliveries (non-residential, not yet built)
– Within 1% error target
– Can improve through GeoPlace/LA collaborative working
• Under-Coverage
– Admin data – are there addresses we can’t find on ABP?
• Sample of 100K – only 2 addresses we can’t find
– Social surveys – are there addresses we might misclassify as
non-residential?
• Sample of 120K – only 135 address we might misclassify (and these
are uncertain)
Communal Establishments
• Really important to Census
– Care homes, university halls, sheltered housing, etc
– Enumeration challenges
– Impact on statistics
• Really important to Admin Data Census!
– Working with ADC to understand their requirements
• Our approach:
– Create CE QA Pack for each CE type
• Definitions, data sources, risks, mitigations, LA risk analysis
– Provide a framework for identifying, monitoring and
improving CE data
Address Matching Demo….
Summary
• AddressBase at the core – need to confirm & ensure quality
• Linked and integrated indexes
• addresses, communals, businesses, attribution
• No separate national address register (except temp / operational)
• it is all about improving the national source
• Increased use of source >>> linking >>> feedback to improve the national hub
• Local Authority liaison critical to the plan
• Share approach and lists much earlier than before
• – but coding of AddressBase & LLPGs the key
• ONS highly supportive of openness / open data
– but not dependant upon it
• Matching Service Talking to GDS, OS, HMRC, BEIS, DWP , Wales … etc.
• Love to share and talk about addresses and matching
addresses@ons.gov.uk
Questions?
(& come and talk to us)
alistair.calder@ons.gov.uk; @alistaircalder_
michael.james@ons.gov.uk
addresses@ons.gov.uk

More Related Content

Similar to Ons households july 17 addressing ac mj

Building an address register for the 2021 Census and beyond
Building an address register for the 2021 Census and beyondBuilding an address register for the 2021 Census and beyond
Building an address register for the 2021 Census and beyondOffice for National Statistics
 
Addresses vs Households : Who needs Household statistics?
Addresses vs Households : Who needs Household statistics?Addresses vs Households : Who needs Household statistics?
Addresses vs Households : Who needs Household statistics?Office for National Statistics
 
Address Frame Requirements: Alistair Calder , Office for National Statistics
Address Frame Requirements: Alistair Calder , Office for National StatisticsAddress Frame Requirements: Alistair Calder , Office for National Statistics
Address Frame Requirements: Alistair Calder , Office for National StatisticstheODI
 
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...Vivastream
 
Geocoding Best Practices: Taking an Address-Centric Approach
Geocoding Best Practices: Taking an Address-Centric ApproachGeocoding Best Practices: Taking an Address-Centric Approach
Geocoding Best Practices: Taking an Address-Centric ApproachPrecisely
 
Lowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityLowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityGeCo in the Rockies
 
Audiologists email &amp; mailing list
Audiologists email &amp; mailing listAudiologists email &amp; mailing list
Audiologists email &amp; mailing listRobertAutin
 
Adopting addressbase-premium in the energy sector
Adopting addressbase-premium in the energy sectorAdopting addressbase-premium in the energy sector
Adopting addressbase-premium in the energy sectorJonathan Clark
 
Anesthetists email list
Anesthetists email listAnesthetists email list
Anesthetists email listRobertAutin
 
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)Jennifer Soares
 
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...Neo4j
 
From Call Center to Field: Improving the Customer Experience with Location In...
From Call Center to Field: Improving the Customer Experience with Location In...From Call Center to Field: Improving the Customer Experience with Location In...
From Call Center to Field: Improving the Customer Experience with Location In...Precisely
 
DM101: Data Hygiene/Data Processing (MMI Direct)
DM101: Data Hygiene/Data Processing (MMI Direct)DM101: Data Hygiene/Data Processing (MMI Direct)
DM101: Data Hygiene/Data Processing (MMI Direct)Avalon Consulting
 
Athletic therapists email list
Athletic therapists email listAthletic therapists email list
Athletic therapists email listRobertAutin
 
DM Crash Course
DM Crash CourseDM Crash Course
DM Crash CourseVivastream
 
Dentists email &amp; mailing list
Dentists email &amp; mailing listDentists email &amp; mailing list
Dentists email &amp; mailing listRobertAutin
 
Chiropractors email list
Chiropractors email listChiropractors email list
Chiropractors email listRobertAutin
 
Furniture and fixtures executives email list
Furniture and fixtures executives email listFurniture and fixtures executives email list
Furniture and fixtures executives email listGlobal B2B Contacts LLC
 
How High Performance B2B Sales Teams Squeeze The Most Out Of Every Lead
How High Performance B2B Sales Teams Squeeze The Most Out Of Every LeadHow High Performance B2B Sales Teams Squeeze The Most Out Of Every Lead
How High Performance B2B Sales Teams Squeeze The Most Out Of Every LeadLeadGenius
 
Architectural services industry email list
Architectural services industry email listArchitectural services industry email list
Architectural services industry email listGlobal B2B Contacts LLC
 

Similar to Ons households july 17 addressing ac mj (20)

Building an address register for the 2021 Census and beyond
Building an address register for the 2021 Census and beyondBuilding an address register for the 2021 Census and beyond
Building an address register for the 2021 Census and beyond
 
Addresses vs Households : Who needs Household statistics?
Addresses vs Households : Who needs Household statistics?Addresses vs Households : Who needs Household statistics?
Addresses vs Households : Who needs Household statistics?
 
Address Frame Requirements: Alistair Calder , Office for National Statistics
Address Frame Requirements: Alistair Calder , Office for National StatisticsAddress Frame Requirements: Alistair Calder , Office for National Statistics
Address Frame Requirements: Alistair Calder , Office for National Statistics
 
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...
1. Better Lists: The NEW New Rules of Email: Leading Approaches that Fortify ...
 
Geocoding Best Practices: Taking an Address-Centric Approach
Geocoding Best Practices: Taking an Address-Centric ApproachGeocoding Best Practices: Taking an Address-Centric Approach
Geocoding Best Practices: Taking an Address-Centric Approach
 
Lowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityLowry colorado state address dataset data quality
Lowry colorado state address dataset data quality
 
Audiologists email &amp; mailing list
Audiologists email &amp; mailing listAudiologists email &amp; mailing list
Audiologists email &amp; mailing list
 
Adopting addressbase-premium in the energy sector
Adopting addressbase-premium in the energy sectorAdopting addressbase-premium in the energy sector
Adopting addressbase-premium in the energy sector
 
Anesthetists email list
Anesthetists email listAnesthetists email list
Anesthetists email list
 
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)
How Focusing on Email Address Quality Can Save Your Buget (And Your Butt)
 
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
 
From Call Center to Field: Improving the Customer Experience with Location In...
From Call Center to Field: Improving the Customer Experience with Location In...From Call Center to Field: Improving the Customer Experience with Location In...
From Call Center to Field: Improving the Customer Experience with Location In...
 
DM101: Data Hygiene/Data Processing (MMI Direct)
DM101: Data Hygiene/Data Processing (MMI Direct)DM101: Data Hygiene/Data Processing (MMI Direct)
DM101: Data Hygiene/Data Processing (MMI Direct)
 
Athletic therapists email list
Athletic therapists email listAthletic therapists email list
Athletic therapists email list
 
DM Crash Course
DM Crash CourseDM Crash Course
DM Crash Course
 
Dentists email &amp; mailing list
Dentists email &amp; mailing listDentists email &amp; mailing list
Dentists email &amp; mailing list
 
Chiropractors email list
Chiropractors email listChiropractors email list
Chiropractors email list
 
Furniture and fixtures executives email list
Furniture and fixtures executives email listFurniture and fixtures executives email list
Furniture and fixtures executives email list
 
How High Performance B2B Sales Teams Squeeze The Most Out Of Every Lead
How High Performance B2B Sales Teams Squeeze The Most Out Of Every LeadHow High Performance B2B Sales Teams Squeeze The Most Out Of Every Lead
How High Performance B2B Sales Teams Squeeze The Most Out Of Every Lead
 
Architectural services industry email list
Architectural services industry email listArchitectural services industry email list
Architectural services industry email list
 

Recently uploaded

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 

Recently uploaded (20)

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 

Ons households july 17 addressing ac mj

  • 1. Addresses Vs Households - RSS July ‘17 Building an address index for census and beyond Alistair Calder Head of Addressing Data Architecture - ONS Mike James Head of Address Research Data Architecture - ONS
  • 2. Addresses • ONS Requirements – and why it has now become easy • Issues – and why it is still really hard • Addressing in Government - joining up • Addresses and Admin data – building quality • Demo (& an annexe)
  • 3. CENSUS OPERATION 2021 CCS Enforce -ment Build register ADDRESS LIST Post Out IAC Hand Deliver HOUSE HOLDS COMMUNAL ESTABLISHMENTS 100K ? 29M ? ONLINE Completion Paper ?? Track response Follow Up Reminder Letters Emails RESPONSE DATABASE Estimation & outputs Admin data OUTPUT DATABASE ESTIMATION
  • 4. CENSUS OPERATION 2021 Post Out IAC Hand Deliver ONLINE Completion Paper ?? CCS Enforce -ment Track response Build register ADDRESS LIST Follow Up HOUSE HOLDS COMMUNAL ESTABLISHMENTS 100K ? 29M ? Estimation & outputs Admin data OUTPUT DATABASE Reminder Letters emails RESPONSE DATABASE ESTIMATION and … • Social Survey – survey frames • Online Survey transformation - eQ • Data collection eg Life Events – locating events • Business & economic statistics – locating & linking • Admin data – spine for linking & integrating • Other government departments • etc
  • 5. The requirement (tbc) • A ‘complete’ household frame >99% of household spaces ( addresses) • Minimal over-coverage duplicates / commercial / demolished etc < 2 or 3% ? • A brilliant (integrated) communal frame • Residential, communal & business (& non postal) • Up to date, correctly located etc etc …. And more
  • 6. Postcode Address File PAF Valuation Office National Land & Property Gazetteer NLPG Address Layer 2 AL2 LLPGs LLPGs x 348 Council Tax TV License Utilities Emerg. Services etc Addressing for Census The olden days
  • 7. Addressing – Census 2011 PAF AL2 VOA Residential address list RULES CLERICAL Communal address list NLPG Very good Just good enough
  • 8. Why it’s going to be easy this time ….
  • 9. PAF AL2 VOA Residential address list RULES NLPG Residential address list PAF AL2 VOA RULES NLPG Communal address list Additional CE sources 2011 Census AddressBase
  • 11. The challenge ..... Why it’s hard this time • We have an excellent starting point but addresses are complicated and change a lot. There will be error & error clusters itself in the areas we care about the most – Very difficult to check quality • Extracting the right ones is difficult. Small errors can be significant – and cause trauma • Communals are important and particularly challenging • We plan to do MUCH more with addresses than post-out – huge opportunity but attribute thinking is new • Addresses are complex so matching is really hard
  • 12.
  • 13.
  • 14. The Emerging Strategy what’s the plan? Flat 1 Flat 2 Flat 3 Flat 4 Flat 5 Flat 6 Flat 7
  • 15. 7
  • 17.
  • 18.
  • 19. 42 5 5? B 10
  • 20. The challenge ..... Why it’s hard this time • We have an excellent starting point but addresses are complicated and change a lot. There will be error & error clusters itself in the areas we care about the most – Very difficult to check quality • Extracting the right ones is difficult. Small errors can be significant – and cause trauma • Communals are important and particularly challenging • We plan to do MUCH more with the register than post-out – huge opportunity but attribute thinking is new • Addresses are complex so matching is really hard
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. The challenge ..... Why it’s hard this time • We have an excellent starting point but addresses are complicated and change a lot. There will be error & error clusters itself in the areas we care about the most – Very difficult to check quality • Extracting the right ones is difficult. Small errors can be significant – and cause trauma • Communals are important and particularly challenging • We plan to do MUCH more with addresses than post-out – huge opportunity but attribute thinking is new • Addresses are complex so matching is really hard
  • 26. Lists of communals Compared to counts from admin data Linked to Business Index Linked to Address Index
  • 27. The challenge ..... Why it’s hard this time • We have an excellent starting point but addresses are complicated and change a lot. There will be error & error clusters itself in the areas we care about the most – Very difficult to check quality • Extracting the right ones is difficult. Small errors can be significant – and cause trauma • Communals are important and particularly challenging • We plan to do MUCH more with addresses than post-out – huge opportunity but attribute thinking is new • Addresses are complex so matching is really hard
  • 28. A probabilistic address frame Probability of • Existence of address • type - HH/B/CE • HH Size / structure • Change / churn • Hard to countness / category • (multivariate >> categorisation • Eg possible holiday home, carehome, student accommodation Address Register HH Structure 2011 Census HH structure, churn, names Activity data Energy, utilities, broadband, health, house sales Admin data HH structure, churn, names, house prices, phone numbers Other Shape / pattern recognition Survey paradata Geoplace And other CE sources CE New definition / schema Inform field planning / targetting Intelligent stratification Prioritise follow up (address level) Inform estimation & modelling B Business Reg Business structure, type, churn Conceptually – all subject to ethical and privacy discussion ! Potentially
  • 29. The challenge ..... Why it’s hard this time • We have an excellent starting point but addresses are complicated and change a lot. There will be error & error clusters itself in the areas we care about the most – Very difficult to check quality • Extracting the right ones is difficult. Small errors can be significant – and cause trauma • Communals are important and particularly challenging • We plan to do MUCH more with addresses than post-out – huge opportunity but attribute thinking is new • Addresses are complex so matching is really hard •
  • 30. The bigger picture – addresses as a linking mechanism across government
  • 31.
  • 32. Government Digital Service (GDS) Vision for Registers
  • 33. ONS or citizen servicesingle address UPRN 10 High St PO15 5RR 1234567891011 batch of addresses addresses UPRNs batch match Addressbase load UPRNs addresses classifications Feedback to source (improving quality) api api ONS Data Library Address Index Business Index Address Matching - Beta
  • 34. correct match rate virtually zero false positives balance between automatic & clerical flexibility of match tuned but not limited fast scalable accessible via api non proprietary code -> open Searching and matching – what we want
  • 35. Avenue Cars Limted 1st Floor St. William of York House 22-24 First Road, Street, Somerset ZE1ODW synonyms thesaurus aliases lookups Parsing Rules based + Machine learning / Natural language Source input address address components how we are going to do it
  • 36. Informed decision – clerical intervention HOPPER SCORE Confidence rank of options ES Fuzzy matching Distance measures synonyms thesaurus aliases lookups Parsing Rules based + Machine learning / Natural language Source input address address components AddressBase hierarchies ESindexes
  • 39. Alpha – Address Index build 2015 2016 2017 2018 2019 2020 2021 2022 2023April July October April July October On-line Survey transformation Admin Data Admin Data – Processing Platform Alpha EDC – eQ Alpha EDC – eQ Beta EDC – Response and Respondent Management Beta Admin Data – Processing Platform Beta EDC – Service enhancement Admin/Survey Integration Discovery Admin/Survey Integration – Alpha Admin/Survey Integration – Beta Alpha - Business Index build Beta - Business index build Beta - Address index build Registers 2019 Census Rehearsal Admin Data for Census Census Register / Index Platform for ONS Live services Decision to proceed to beta Develop data migration and data loader for new BIS data source IDBR Service Migration IDBR Migration Roadmap Business Statistics Decision(s) to go live 2021 Census Life Events, Social Survey etc etc
  • 40. The Address Register in an Admin Data Census • What is the role of the Address Register • Address Register Quality • Address Matching Demo (what could possibly go wrong…?) A perfect address register won't overcome all the issues of moving from HH to address definition But it sure would be helpful…
  • 41. The Address Register in an Admin Data Census People on Admin Data Address Register
  • 42. Address Register Quality People on Admin Data Address Register Over Coverage
  • 43. Address Register Quality People on Admin Data Address Register Under Coverage Over Coverage
  • 44. Matching Addresses to the Address Register People on Admin Data Address Register Under Coverage Over CoverageAddress Matching
  • 45. Citizen Address Search Citizens Identifying Their Address in Admin Data Address Register Under Coverage Over Coverage I Live There
  • 46. Strategy for Delivering Quality • Using AddressBase Premium (ABP) – 2.2M more residential addresses than PAF • Close partnership with Geoplace • Lots of LA engagement • Supporting the use of ABP in Government – embed Unique Property Reference Number (UPRN) throughout government data • Understanding types of error, their causes and impacts – Over coverage (duplication, misclassifications) • Non-existent annexes (included by some idiot…..) – Under coverage (missed HMO, missing new builds, misclassifications) – Single instance or clustered?
  • 47. Methods & Evidence of Quality • Over-Coverage – Social survey outcomes (does the sample include non- residential addresses?) • Using ABP to clean PAF removes majority of non-residential addresses – Analysing Census tests • Number and cause on non-deliveries (non-residential, not yet built) – Within 1% error target – Can improve through GeoPlace/LA collaborative working • Under-Coverage – Admin data – are there addresses we can’t find on ABP? • Sample of 100K – only 2 addresses we can’t find – Social surveys – are there addresses we might misclassify as non-residential? • Sample of 120K – only 135 address we might misclassify (and these are uncertain)
  • 48. Communal Establishments • Really important to Census – Care homes, university halls, sheltered housing, etc – Enumeration challenges – Impact on statistics • Really important to Admin Data Census! – Working with ADC to understand their requirements • Our approach: – Create CE QA Pack for each CE type • Definitions, data sources, risks, mitigations, LA risk analysis – Provide a framework for identifying, monitoring and improving CE data
  • 50. Summary • AddressBase at the core – need to confirm & ensure quality • Linked and integrated indexes • addresses, communals, businesses, attribution • No separate national address register (except temp / operational) • it is all about improving the national source • Increased use of source >>> linking >>> feedback to improve the national hub • Local Authority liaison critical to the plan • Share approach and lists much earlier than before • – but coding of AddressBase & LLPGs the key • ONS highly supportive of openness / open data – but not dependant upon it • Matching Service Talking to GDS, OS, HMRC, BEIS, DWP , Wales … etc. • Love to share and talk about addresses and matching addresses@ons.gov.uk
  • 51.
  • 52. Questions? (& come and talk to us) alistair.calder@ons.gov.uk; @alistaircalder_ michael.james@ons.gov.uk addresses@ons.gov.uk