RGS Annual Conference Presentation


Published on

Published in: Technology, Economy & Finance
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • It is worth noting that the trends/ regions I am seeking to identify are best represented in “Anglo-Saxon” names or those with origins in Britain. Migrant names, although interesting, are included in the calculations but do not exert significant influence on regional characteristics. The exception is London in the 2001 data.
  • Kernel Density Estimation maps to show the areas of highest frequency of a particular name in Britain. Two extremely common names at the top, two rarer names at the bottom.
  • Lasker’s Coefficient of Isonymy is widely used for surname studies and extends the idea of monophyly (sharing a single common ancestor) between two populations. Measure explained as the probability of members of two populations or subpopulations having genes in common by descent as estimated from sharing the same surnames. No the intention of this talk to go into significant depth regarding this measure.
  • Map of Ward’s Clustering, splitting Britain into 15 clusters. Despite the fact that spatial information regarding the geographical locations of the districts has not been included in the clustering and that there are no continuity constraints, the resulting regions at 15 clusters are surprisingly homogenous.
  • The town of Corby is consistently clustered/ highlighted as a Scottish District in 2001, not a central England as would be expected given its location in Northamptonshire. This is not the case with the 1881 data, suggesting a Scottish migration into the area.
  • This migration theory appears to be plausible.
  • Finally, the town that voted to be Welsh. Do the surnames of its population get clustered into the Welsh group or an English one?
  • Political motives, such as free prescriptions, rather than genealogical or cultural motives appear to be driving the locals to vote to be Welsh. It could of course also have been tongue in cheek!.
  • RGS Annual Conference Presentation

    1. 1. Surnames: A Rich Source of Geodemographic Data <br />James Cheshire, Pablo Mateos<br />Department of Geography, University College London <br />Research Blog: jamescheshire.co.uk<br />Email: james.cheshire@ucl.ac.uk<br />
    2. 2. Names and Ethnicity<br />- Forenames and surnames can be classified into ethnic groupings.<br />- Already utilized within geodemographics (see Mateos et al., 2007).<br />
    3. 3. In Britain:<br />Cornish names<br />Welsh names<br />
    4. 4. Surnames and Regions<br /><ul><li>Many surnames originate from a specific area.
    5. 5. The highest frequency of these names still exists in their place of origin.
    6. 6. We can therefore expect areas to possess unique combinations of names.
    7. 7. We can also expect certain types of surname to occur more frequently in some areas rather than others.
    8. 8. This surname geography may reflect cultural characteristics and regional identities…</li></li></ul><li>Some Examples:<br />Smith<br />Lewis<br />Macleod<br />Buckley<br />
    9. 9. Creating Regions: Aggregating Surname Data<br /><ul><li> Isonymy: The occurrence of the same name in marriage.
    10. 10. The smaller the surname ‘pool’ the greater the probability of isonymy.
    11. 11. Geneticists developed the Coefficient of Isonymy to measure probability of isonymy between two populations.</li></ul>xand y: Districts<br />i: Surname<br />xiand yi: Freq. proportional to the xand y total popn.<br /><ul><li>The Coefficient of Isonymyhas been extended to a distance measure, the Lasker’s Distance, for comparison between populations. </li></ul>Lx,y= -loge2(Rx,y) <br />
    12. 12. Lasker’s Distance Matrices<br />1881 Matrix<br />2001 Matrix<br /> Yarmouth Yeovil York <br />Aberayron 6.389540 6.289929 6.438361<br /> Aberdeen 6.356152 7.019357 6.213222<br />Abergavenny 6.412893 6.361753 6.566717<br />Aberystwith 6.327093 6.319481 6.467985<br /> Abingdon 6.353814 6.559106 6.621873<br /> 95Z 99ZZ OOLN<br /> 00BL 7.520982 7.336616 7.219516<br /> 00BM 7.428889 7.315671 7.425037<br /> 00BN 7.347616 7.356772 7.394888<br /> 00BP 7.452982 7.299915 7.330886<br /> 00BQ 7.410027 7.300150 7.387787<br />
    13. 13. Regional<br /> identity in Britain<br />
    14. 14. Creating Regions: Ward’s Hierarchical Clustering <br />2001<br />1881<br />
    15. 15. Creating Regions: K-Means Clustering<br />2001<br />
    16. 16. Creating Regions: K-Means Clustering<br />1881<br />
    17. 17. Corby: A Scottish Town?<br />1881<br />2001<br />MDS<br />Ward’s<br />K-Means<br />
    18. 18. Corby: A Scottish Town?<br />In 1932 Stewarts and Lloyds built a new iron and steel works in Corby.<br />Labor sourced from closing Scottish steelworks, mainly in Lanarkshire.<br />Into the 1970s, 50% of the incoming population Scottish.<br />Transformed population from 1,500 to 34,000 .<br />Annual Highland Games.<br />
    19. 19. TheVillage of Audlem…Is it Welsh?<br />
    20. 20. Back to Audlem…Is it Welsh?<br />
    21. 21. Conclusions<br /><ul><li>Surnames are most common around their place of origin.
    22. 22. Comparing the similarity of surname compositions across space creates a regional geography of surnames.
    23. 23. With further work, it may be possible to use surname regions or clusters to better characterize and subdivide the British population for geodemographic analysis. </li></ul>jamescheshire.co.uk<br />
    24. 24. References<br />Lasker Distance:<br />Lasker, G. W. and C. G. N. Mascie-Taylor (2001). &quot;The genetic structure of English villages: surname diversity changes between 1976 and 1997.&quot; Annals of Human Biology 28(5): 546-553.<br />K-Means:<br />Adnan, M., Singleton, A.D., Brunsdon, C., Longley, P.A. 2009. Moving to Real-Time Segmentation: Efficient Computation of Geodemographic Classification. GISRUK 2009.<br />Surname Ethnicity Classification:<br />Mateos, Webber and Longley (2007) The Cultural, Ethnic and Linguistic Classification of Populations and Neighbourhoods using Personal Names , CASA Working Paper 116, Centre for Advanced Spatial Analysis, University College London.<br />Monmonier Algorithm:<br />Manni, F., E. Guerard, et al. (2004). &quot;Geographic Patterns of (Genetic, Morphologic, Linguistic) Variation: How Barriers Can Be Detected by Using Monmonier’s Algorithm.&quot; Human Biology 76(2): 173-190.<br />KDE:<br />Crimestat Workbook: http://www.icpsrdirect.org/CRIMESTAT/workbook/CrimeStat_III_Workbook_PowerPoint.ppt<br />R Packages:<br />Adegenet, cluster, maptools, rgl, sm, spdep , splancsfrom http://cran.r-project.org<br />iL04_1.13 from http://www.let.rug.nl/~kleiweg/L04/<br />All boundary data from the maps Crown Copyright Ordnance Survey 2009.<br />
    25. 25. jamescheshire.co.uk<br />