There are a range of statistical methods Multilevel and hierarchical modelling etc But we’re not using them We’re creating a synthetic ‘Income Census’ We fill each ‘area’ (LSOA)… with ALL households from the relevant region Then give them fractional weights so that key constraint variables in each area match known Census distributions
(and must be in the Census or Census-like data) Relatively few candidates in the Census
There are a range of statistical methods Multilevel and hierarchical modelling etc But we’re not using them We’re creating a synthetic ‘Income Census’ We fill each ‘area’ (LSOA)… with ALL households from the relevant region Then give them fractional weights so that key constraint variables in each area match known Census distributions
OECD scale : 1 (first adult) + 0.5 for extra adults & children except 0.3 for child < 14
No external validation available (yet) for expenditure
Uses Stephen Jenkins’ stata ineqdec0 command to include 0 values
No external validation available (yet) for expenditure
,408 = Ghana, Turkmenistan & United States .268 = Bosnia, Hungary, Finland http://en.wikipedia.org/wiki/List_of_countries_by_income_equality
R sq = 21%
Total minutes per day used as sums total behaviour - e.g. what if per minute charging for service? Also indicator of broadband bandwidth demand.
R sq = only 10.9% Total minutes per day used as sums total behaviour - e.g. what if per minute charging for service? Also indicator of broadband bandwidth demand.
Can’t show demand system modelling results (yet) but here a fictitious example of what we’ve been doing
Validation against the census is excellent!
Total minutes per day used as sums total behaviour - e.g. what if per minute charging for service? Also indicator of broadband bandwidth demand.
Traditional static microsimulation
Would be very interesting to look again at this as broadband changes the way TV is delivered
Demand system: ML method, as gets more complex (e.g. more demographics) gets even slower
Estimating income, expenditure and time-use within small areas - Presentation Transcript
Estimating income, expenditure and time-use within small areas Ben Anderson, Paola De Agostini & Tony Lawson ESRC/BSPS Microsimulation Seminar Series University of Leeds July 2nd 2009
A story in five parts…
What’s the problem?
And what can we do about it?
Income
Small area income inequality
Expenditure:
Small area spending
Time-use:
A spatialised demand system model
Current activities
Census projection
Agent-Based Microsimulation
What are ‘ small areas ’?
Base map
Parliamentary Constituencies
Local/Unitary Authorities
Wards
Lower Layer Super Output Areas (LSOAs)
Output Areas (OAs)
[Postcodes]
Why small area data?
‘ Applied’ reasons
Public & private resource allocation
Public & private service delivery planning
‘ Research’ reasons
‘ small area’ level effects and trends
Deprivation indices and correlates
Environmental/ecological correlates
Local area inequalities and social justice (Runciman)
Intervention analysis
Sampling frameworks
Spatially embedded patterns of consumption
… ?
BUT!
Small area data on people are rare
Census - down to OA level
Administrative - postcode but restricted access
Private/commercial - postcode but restricted access
And the data captured are restricted
Broad but shallow
Rarely much on consumption or time-use
And of sometimes uncertain quality
The requirement…
Synthetic Census(es)
Small area estimates
Sociologically interesting phenomena
Validated where possible
And also
What if? (tomorrow)
What if? (in 5 years time)
What if? (by 2021)
How to achieve it?
Survey data cases
Area 1 in Region X Area 2 in Region X If region = X Census ‘constraint’ tables Weights e.g. Iterative proportional fitting, combinatorial optimization
Identifying constraints
Two criteria:
Exist in Census and Survey
Good predictors of X in survey
We need to know
Which ones make the most difference?
Which ones co-vary?
What is the smallest effective set?
Take microdata (survey)
Stepwise regression of all possible constraints
Which ones are best indicators?
Example: Income deprivation
FRS 2004-6 (pooled)
p(income < 60% UK median)
Net income
Equivalised
Before housing costs
Stepwise regression to select constraints
How to achieve it?
Survey data cases
Area 1 in Region X Area 2 in Region X If region = X Census ‘constraint’ tables Weights e.g. Iterative proportional fitting, combinatorial optimization
The result…
A story in five parts…
What’s the problem?
And what can we do about it?
Income
Small area income inequality
Expenditure:
Small area spending
Time-use:
A spatialised demand system model
Current activities
Agent-Based Microsimulation
Census projection
Income Deprivation
For each (L)SOA:
% households below 60% UK median income
% individuals in households below 60% UK median income
% children …
% working age adults
% pensioners
Why?
Potential use in future IMD (CLG funded)
Analyses of deprivation patterns (CRC, NISRA, WAG funded)
Data
Income: Family Resources Survey (FRS) 2004-6
Space: Census 2001 (LSOAs) and LFS 2004-6 (LA/UAs)
Net equivalised income BHC/AHC
Results: Households
BHC
AHC
Results: Pensioners
BHC
AHC
Validation
Spearman correlation results at English LSOA level for household poverty rates and IMD 2007 income domain score
Outliers where IMD income domain score low but % HHBMI high:
areas with many student HRPs
We have an income distribution…
Local income inequality
So we can calculate a gini coefficient for each zone…
Income inequality within small areas
Local income inequality - results
Mean equivalised household income
Equivalised household income gini
Local income inequality - gini
FRS 2005
East of England, LSOA
LSOA gini:
Mean = .35 (UK = .36 [UN])
Rural zones
Max = Central Cambridge
(gini = .408, % HHBMI = 22%)
Min = South East Ipswich
(gini = .268, %HHBMI = 31.8%)
A story in five parts…
What’s the problem?
And what can we do about it?
Income
Small area income inequality
Expenditure:
Small area spending
Time-use:
A spatialised demand system model
Current activities
Agent-Based Microsimulation
Census projection
Expenditure Modelling
Approach
Census 2001
Expenditure & Food Surveys
Demand system models
Spatial microsimulation
Why?
Resource/consumption ‘poverty’ and inequality modelling
‘ Share of wallet’ & revenue modelling for BT
Water 2005/6
Simulated household weekly water expenditure (EFS 2005/6)
EEDA, LSOA level
Water gini
Water Poverty 2005/6
% Household who spend > 3% of income on water (EFS 2005/6)
AHC = After Housing Costs income
EEDA, LSOA level
Probably needs equivalised income
to highlight low income / high demand
Telephony: 2005/6
Simulated household weekly telephone bill (landlines) (EFS 2005/6)
EEDA, LSOA level
Ward level comparison with BT billing data (EEDA, Ward level)
Spearman rho = 0.7796, p < 0.001
Broadband tax…Noddy’s view
Simple ROI model
£500 cost per household
20% increase in household level spend on internet subscription
50% of additional income to repay investment
No spatial cross-subsidisation
How long to pay back?
A story in five parts…
What’s the problem?
And what can we do about it?
Income
Small area income inequality
Expenditure:
Small area spending
Time-use:
A spatialised demand system model
Current activities
Agent-Based Microsimulation
Census projection
Time in Space: Work
Simulated ‘work time’
ONS Time-Use Survey (2001) and Census 2001
East of England, LSOA
Validation:
(Spearman rho = 0.8404, p < 0.001)
Strong correlation with Census 2001 ‘work time’
Time in Space - Mass media
Simulated total ‘mass media’ time (2001) - TV, radio, DVD, video
Presentation at 'Moving beyond tax-benefit and demo more
Presentation at 'Moving beyond tax-benefit and demographic modelling', University of Leeds on July 2nd 2009. One of a series co-funded by the ESRC and the BSPS on ‘Microsimulation modelling in the UK: bridging the gaps’. less
0 comments
Post a comment