SlideShare a Scribd company logo
1 of 18
Surnames: A Rich Source of Geodemographic Data  James Cheshire, Pablo Mateos Department of Geography, University College London  Research Blog: jamescheshire.co.uk Email: james.cheshire@ucl.ac.uk
Names and Ethnicity - Forenames and surnames can be classified into ethnic groupings. - Already utilized within geodemographics (see Mateos et al., 2007).
In Britain: Cornish names Welsh names
Surnames and Regions ,[object Object]
The highest frequency of these names still exists in their place of origin.
We can therefore expect areas to possess unique combinations of names.
We can also expect certain types of surname to occur more frequently in some areas rather than others.
This surname geography may reflect cultural characteristics and regional identities…,[object Object]
Creating Regions: Aggregating Surname Data ,[object Object]
 The smaller the surname ‘pool’ the greater the probability of isonymy.
Geneticists developed the Coefficient of Isonymy to measure probability of isonymy between two populations.xand y: Districts i: Surname xiand yi: Freq. proportional to the xand y total popn. ,[object Object],Lx,y= -loge2(Rx,y)
Lasker’s Distance Matrices 1881 Matrix 2001 Matrix 		         Yarmouth  Yeovil        York  Aberayron      6.389540 6.289929 6.438361    Aberdeen       6.356152 7.019357 6.213222 Abergavenny  6.412893 6.361753 6.566717 Aberystwith    6.327093 6.319481 6.467985    Abingdon       6.353814 6.559106 6.621873  	       95Z     99ZZ     OOLN   00BL 7.520982 7.336616 7.219516   00BM 7.428889 7.315671 7.425037   00BN 7.347616 7.356772 7.394888   00BP 7.452982 7.299915 7.330886   00BQ 7.410027 7.300150 7.387787
Regional  identity in Britain
Creating Regions: Ward’s Hierarchical Clustering  2001 1881
Creating Regions: K-Means Clustering 2001
Creating Regions: K-Means Clustering 1881
Corby: A Scottish Town? 1881 2001 MDS Ward’s K-Means
Corby: A Scottish Town? In 1932 Stewarts and Lloyds built a new iron and steel works in Corby. Labor sourced from closing Scottish steelworks, mainly in Lanarkshire. Into the 1970s, 50% of the incoming population Scottish. Transformed population from 1,500 to 34,000 . Annual Highland Games.

More Related Content

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

RGS Annual Conference Presentation

  • 1. Surnames: A Rich Source of Geodemographic Data James Cheshire, Pablo Mateos Department of Geography, University College London Research Blog: jamescheshire.co.uk Email: james.cheshire@ucl.ac.uk
  • 2. Names and Ethnicity - Forenames and surnames can be classified into ethnic groupings. - Already utilized within geodemographics (see Mateos et al., 2007).
  • 3. In Britain: Cornish names Welsh names
  • 4.
  • 5. The highest frequency of these names still exists in their place of origin.
  • 6. We can therefore expect areas to possess unique combinations of names.
  • 7. We can also expect certain types of surname to occur more frequently in some areas rather than others.
  • 8.
  • 9.
  • 10. The smaller the surname ‘pool’ the greater the probability of isonymy.
  • 11.
  • 12. Lasker’s Distance Matrices 1881 Matrix 2001 Matrix Yarmouth Yeovil York Aberayron 6.389540 6.289929 6.438361 Aberdeen 6.356152 7.019357 6.213222 Abergavenny 6.412893 6.361753 6.566717 Aberystwith 6.327093 6.319481 6.467985 Abingdon 6.353814 6.559106 6.621873 95Z 99ZZ OOLN 00BL 7.520982 7.336616 7.219516 00BM 7.428889 7.315671 7.425037 00BN 7.347616 7.356772 7.394888 00BP 7.452982 7.299915 7.330886 00BQ 7.410027 7.300150 7.387787
  • 13. Regional identity in Britain
  • 14. Creating Regions: Ward’s Hierarchical Clustering 2001 1881
  • 15. Creating Regions: K-Means Clustering 2001
  • 16. Creating Regions: K-Means Clustering 1881
  • 17. Corby: A Scottish Town? 1881 2001 MDS Ward’s K-Means
  • 18. Corby: A Scottish Town? In 1932 Stewarts and Lloyds built a new iron and steel works in Corby. Labor sourced from closing Scottish steelworks, mainly in Lanarkshire. Into the 1970s, 50% of the incoming population Scottish. Transformed population from 1,500 to 34,000 . Annual Highland Games.
  • 20. Back to Audlem…Is it Welsh?
  • 21.
  • 22. Comparing the similarity of surname compositions across space creates a regional geography of surnames.
  • 23. With further work, it may be possible to use surname regions or clusters to better characterize and subdivide the British population for geodemographic analysis. jamescheshire.co.uk
  • 24. References Lasker Distance: Lasker, G. W. and C. G. N. Mascie-Taylor (2001). "The genetic structure of English villages: surname diversity changes between 1976 and 1997." Annals of Human Biology 28(5): 546-553. K-Means: Adnan, M., Singleton, A.D., Brunsdon, C., Longley, P.A. 2009. Moving to Real-Time Segmentation: Efficient Computation of Geodemographic Classification. GISRUK 2009. Surname Ethnicity Classification: Mateos, Webber and Longley (2007) The Cultural, Ethnic and Linguistic Classification of Populations and Neighbourhoods using Personal Names , CASA Working Paper 116, Centre for Advanced Spatial Analysis, University College London. Monmonier Algorithm: Manni, F., E. Guerard, et al. (2004). "Geographic Patterns of (Genetic, Morphologic, Linguistic) Variation: How Barriers Can Be Detected by Using Monmonier’s Algorithm." Human Biology 76(2): 173-190. KDE: Crimestat Workbook: http://www.icpsrdirect.org/CRIMESTAT/workbook/CrimeStat_III_Workbook_PowerPoint.ppt R Packages: Adegenet, cluster, maptools, rgl, sm, spdep , splancsfrom http://cran.r-project.org iL04_1.13 from http://www.let.rug.nl/~kleiweg/L04/ All boundary data from the maps Crown Copyright Ordnance Survey 2009.

Editor's Notes

  1. It is worth noting that the trends/ regions I am seeking to identify are best represented in “Anglo-Saxon” names or those with origins in Britain. Migrant names, although interesting, are included in the calculations but do not exert significant influence on regional characteristics. The exception is London in the 2001 data.
  2. Kernel Density Estimation maps to show the areas of highest frequency of a particular name in Britain. Two extremely common names at the top, two rarer names at the bottom.
  3. Lasker’s Coefficient of Isonymy is widely used for surname studies and extends the idea of monophyly (sharing a single common ancestor) between two populations. Measure explained as the probability of members of two populations or subpopulations having genes in common by descent as estimated from sharing the same surnames. No the intention of this talk to go into significant depth regarding this measure.
  4. Map of Ward’s Clustering, splitting Britain into 15 clusters. Despite the fact that spatial information regarding the geographical locations of the districts has not been included in the clustering and that there are no continuity constraints, the resulting regions at 15 clusters are surprisingly homogenous.
  5. The town of Corby is consistently clustered/ highlighted as a Scottish District in 2001, not a central England as would be expected given its location in Northamptonshire. This is not the case with the 1881 data, suggesting a Scottish migration into the area.
  6. This migration theory appears to be plausible.
  7. Finally, the town that voted to be Welsh. Do the surnames of its population get clustered into the Welsh group or an English one?
  8. Political motives, such as free prescriptions, rather than genealogical or cultural motives appear to be driving the locals to vote to be Welsh. It could of course also have been tongue in cheek!.