SlideShare a Scribd company logo
1 of 18
Data Mining to Understand International
Dimensions to Online Identity
- a classification of 2+ billion names and
their linkage to virtual identities and
social network traffic.




•   Alistair Leak
•   UCL SECReT
•   a.leak.11@ucl.ac.uk
Who am I?
Education:
Kingston University (BSc) - GIS
UCL (M.Res) - Advanced Spatial Analysis and Visualisation
UCL 3+1 - PhD Security and Crime Science


Supervisors:
1st Supervisor: Professor Paul Longley
2nd Supervisor: Dr James Cheshire
Definitions:
• Netnography
  – “A qualitative, interpretive research methodology that uses
    internet-optimized ethnographic research techniques to study the
    social context in online communities” (Kozinets,2009)

• Cybergeodemographics
  – “The analysis of people by where they live and by whom they
    interact with, in real and virtual space” (Longley, 2012)
Uncertainty of Identity: Work Package 4:
    Cybergeodemographics
•   Use of primary and secondary data to relate virtual Internet traffic to the
    probable physical locations from which it emanated; and the development
    of typologies of social networks that are robust, generalized and related to
    physical locations.


                                     Secondary
     Data Collection Tools             Data
           (WP1)
                                                     Cybergeodemographics
                                                            (WP4)
         Text Analytics
            (WP2)
Working Title:
• “Data Mining to Understand International Dimensions to
  Online Identity - a classification of 2+ billion names and
  their linkage to virtual identities and social network traffic”



  Objectives:
• Develop spatial context of name network classification
• Develop typologies of social networks
• Measure how representative social media is of the
  underlying population.
Work Plan
•   M.Res (Present – 2013)
     – Foundation work
         • Assess representative capability of tweet data
     – Skills Development
         • Spatio-Temporal Data Mining
         • Database Management


•   Ph.D (2013 – 2016)
     – Objectives
         • Develop spatial component of names networks
         • Develop typologies of social networks
         • Develop a measure of uncertainty
     – Completion in August 2016
Data Sources:




*Sina Weibo
Case Study: Tweets
   in London




• 1.4 Million Tweets
  over 3 months
  Sep - Dec 2012
What’s in a Tweet?

First Name
                         Surname
Unique ID
                        # Themes
 Location
                      Possibilities:
                     •Political Affiliation
Popularity           •Gender
                     •Age
                     •Location

Interactions

                        Time/Date
Data Classification
• Gender
  – Database of 62000 names + genders
  – Determined by Forename
• Demographic
  – OAC – Output area classifier
• ONOMAP
  – Ethnicity, Religion, Geographical Origin.
  – Determined by Forename Surname combination
Data Classification
Tweets by ONOMAP Religion
Tweets by ONOMAP Religion
Tweets by ONOMAP Group
Challenges of Study

• Signal from Noise
  – Tweets are not all sent from individuals homes
     • Day and night demographics
  – Not all location tweets are real people
• Data Quality/Sample Size
  – Twitter users are self selecting
     • Only a small proportion have enabled location services
     • Dataset currently has 92,000 unique users
Target Areas of Study

• Spatio-temporal differentiation of tweets
  – Night
  – Day
  – Travel
• Expansion of the Methodology for World Names
  – Initially into Europe.
• Application of new name datasets.
References:
•   Dale, M. R. T., and M-J. Fortin. "From graphs to spatial graphs." Annual Review of Ecology,
    Evolution, and Systematics 41.1 (2010): 21.
•   Fischer, E. (July, 2011). World Map of Flikr and Twitter Locations. In See Something or Say
    Something. Available at http://www.flickr.com/photos/walkingsf/5912169471/in/set-72157627140310742
•   http://urbantick.blogspot.co.uk/2010/12/ncl-social-networks.html
•   Kozinets, Robert V. Netnography: Doing ethnographic research online. Sage Publications Limited,
    2009.
•   R Core Team (2012). R: A language and environment for statistical computing. R Foundation for
•     Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-project.org/.
•   Rao, D., Yarowsky, D., Shreevats, A., & Gupta, M. (2010, October). Classifying latent user attributes
    in twitter. In Proceedings of the 2nd international workshop on Search and mining user-generated
    contents (pp. 37-44). ACM.
Thank-you




X Factor Graph
Produced with R and Gephi

More Related Content

What's hot

Is data publication the right metaphor?
Is data publication the right metaphor?Is data publication the right metaphor?
Is data publication the right metaphor?Research Data Alliance
 
Guest Lecture Irvine
Guest Lecture IrvineGuest Lecture Irvine
Guest Lecture IrvineDerek Hansen
 
Crime Patterns and Urban Living - Dr. Patricia Brantingham
Crime Patterns and Urban Living - Dr. Patricia BrantinghamCrime Patterns and Urban Living - Dr. Patricia Brantingham
Crime Patterns and Urban Living - Dr. Patricia BrantinghamCityAge
 
COSMOS
COSMOSCOSMOS
COSMOSNSMNSS
 
Social network analysis
Social network analysisSocial network analysis
Social network analysisFEG
 
How to utilize ‘big data’ on SNS for academic purpose?
How to utilize ‘big data’ on SNS  for academic purpose?How to utilize ‘big data’ on SNS  for academic purpose?
How to utilize ‘big data’ on SNS for academic purpose?Han Woo PARK
 
Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016suresh sood
 
Acrlul swebinar coleman
Acrlul swebinar colemanAcrlul swebinar coleman
Acrlul swebinar colemanTechLibraries
 
Social Network Analysis - an Introduction (minus the Maths)
Social Network Analysis - an Introduction (minus the Maths)Social Network Analysis - an Introduction (minus the Maths)
Social Network Analysis - an Introduction (minus the Maths)Katy Jordan
 
Digital wharton
Digital whartonDigital wharton
Digital whartoncampbedm
 
June sustick edtc 661 assignment 5 professional development 1
June sustick edtc 661 assignment 5 professional development 1June sustick edtc 661 assignment 5 professional development 1
June sustick edtc 661 assignment 5 professional development 1jsustick
 
the rhythms of occupy: broadcasting and listening practices on #ows
the rhythms of occupy: broadcasting and listening practices on #owsthe rhythms of occupy: broadcasting and listening practices on #ows
the rhythms of occupy: broadcasting and listening practices on #owsZizi Papacharissi
 
Netnography - Theory & How-To's
Netnography - Theory & How-To'sNetnography - Theory & How-To's
Netnography - Theory & How-To'sTony Yu
 
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...elpinchito
 
Netnography webinar
Netnography webinarNetnography webinar
Netnography webinarsuresh sood
 
Korea japan twitter (14 jan2011)updating
Korea japan twitter (14 jan2011)updatingKorea japan twitter (14 jan2011)updating
Korea japan twitter (14 jan2011)updatingHan Woo PARK
 
David De Roure - What's so different about Arts and Humanities data?
David De Roure - What's so different about Arts and Humanities data?David De Roure - What's so different about Arts and Humanities data?
David De Roure - What's so different about Arts and Humanities data?DCC-info
 
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsUsing Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsMatthew Rowe
 

What's hot (19)

James Robson - Politics, Power, and Performance: An ethnography of religious ...
James Robson - Politics, Power, and Performance: An ethnography of religious ...James Robson - Politics, Power, and Performance: An ethnography of religious ...
James Robson - Politics, Power, and Performance: An ethnography of religious ...
 
Is data publication the right metaphor?
Is data publication the right metaphor?Is data publication the right metaphor?
Is data publication the right metaphor?
 
Guest Lecture Irvine
Guest Lecture IrvineGuest Lecture Irvine
Guest Lecture Irvine
 
Crime Patterns and Urban Living - Dr. Patricia Brantingham
Crime Patterns and Urban Living - Dr. Patricia BrantinghamCrime Patterns and Urban Living - Dr. Patricia Brantingham
Crime Patterns and Urban Living - Dr. Patricia Brantingham
 
COSMOS
COSMOSCOSMOS
COSMOS
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
How to utilize ‘big data’ on SNS for academic purpose?
How to utilize ‘big data’ on SNS  for academic purpose?How to utilize ‘big data’ on SNS  for academic purpose?
How to utilize ‘big data’ on SNS for academic purpose?
 
Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016
 
Acrlul swebinar coleman
Acrlul swebinar colemanAcrlul swebinar coleman
Acrlul swebinar coleman
 
Social Network Analysis - an Introduction (minus the Maths)
Social Network Analysis - an Introduction (minus the Maths)Social Network Analysis - an Introduction (minus the Maths)
Social Network Analysis - an Introduction (minus the Maths)
 
Digital wharton
Digital whartonDigital wharton
Digital wharton
 
June sustick edtc 661 assignment 5 professional development 1
June sustick edtc 661 assignment 5 professional development 1June sustick edtc 661 assignment 5 professional development 1
June sustick edtc 661 assignment 5 professional development 1
 
the rhythms of occupy: broadcasting and listening practices on #ows
the rhythms of occupy: broadcasting and listening practices on #owsthe rhythms of occupy: broadcasting and listening practices on #ows
the rhythms of occupy: broadcasting and listening practices on #ows
 
Netnography - Theory & How-To's
Netnography - Theory & How-To'sNetnography - Theory & How-To's
Netnography - Theory & How-To's
 
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
 
Netnography webinar
Netnography webinarNetnography webinar
Netnography webinar
 
Korea japan twitter (14 jan2011)updating
Korea japan twitter (14 jan2011)updatingKorea japan twitter (14 jan2011)updating
Korea japan twitter (14 jan2011)updating
 
David De Roure - What's so different about Arts and Humanities data?
David De Roure - What's so different about Arts and Humanities data?David De Roure - What's so different about Arts and Humanities data?
David De Roure - What's so different about Arts and Humanities data?
 
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsUsing Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
 

Similar to Phd Colloquium Spatial Analysis

Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataLeon Derczynski
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Digital Methods Initiative
 
Digital Humanities Venice Group Presentation - Opening the Libro d'Oro
Digital Humanities Venice Group Presentation - Opening the Libro d'OroDigital Humanities Venice Group Presentation - Opening the Libro d'Oro
Digital Humanities Venice Group Presentation - Opening the Libro d'OroMichael Mitchell
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesChantal van Son
 
Summer Social Webshop: Technology-Mediated Social Participation
Summer Social Webshop: Technology-Mediated Social ParticipationSummer Social Webshop: Technology-Mediated Social Participation
Summer Social Webshop: Technology-Mediated Social ParticipationUniversity of Maryland
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNHaewoon Kwak
 
Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Digital Methods Initiative
 
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Andrea Scharnhorst
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsAmit Sheth
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc SmithMarc Smith
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) WebDavid Crowley
 
New analytical methods for geocomputation - Guy Lansley, UCL
New analytical methods for geocomputation - Guy Lansley, UCLNew analytical methods for geocomputation - Guy Lansley, UCL
New analytical methods for geocomputation - Guy Lansley, UCLGuy Lansley
 

Similar to Phd Colloquium Spatial Analysis (20)

2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media Data
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
Oess NCRM Festival
Oess NCRM FestivalOess NCRM Festival
Oess NCRM Festival
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
 
Digital Humanities Venice Group Presentation - Opening the Libro d'Oro
Digital Humanities Venice Group Presentation - Opening the Libro d'OroDigital Humanities Venice Group Presentation - Opening the Libro d'Oro
Digital Humanities Venice Group Presentation - Opening the Libro d'Oro
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Our World is Socio-technical
Our World is Socio-technicalOur World is Socio-technical
Our World is Socio-technical
 
Summer Social Webshop: Technology-Mediated Social Participation
Summer Social Webshop: Technology-Mediated Social ParticipationSummer Social Webshop: Technology-Mediated Social Participation
Summer Social Webshop: Technology-Mediated Social Participation
 
Ir1
Ir1Ir1
Ir1
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSN
 
Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_
 
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
 
Data, Indicators and Maps on Homelessness
Data, Indicators and Maps on HomelessnessData, Indicators and Maps on Homelessness
Data, Indicators and Maps on Homelessness
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
 
Methods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social ParticipationMethods and Tools for Facilitating Social Participation
Methods and Tools for Facilitating Social Participation
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) Web
 
Data Driven Ontology Practices: The Real world objects of Ordnance Survey Ir...
Data Driven Ontology Practices: The Real world objects of  Ordnance Survey Ir...Data Driven Ontology Practices: The Real world objects of  Ordnance Survey Ir...
Data Driven Ontology Practices: The Real world objects of Ordnance Survey Ir...
 
New analytical methods for geocomputation - Guy Lansley, UCL
New analytical methods for geocomputation - Guy Lansley, UCLNew analytical methods for geocomputation - Guy Lansley, UCL
New analytical methods for geocomputation - Guy Lansley, UCL
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 

Recently uploaded (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 

Phd Colloquium Spatial Analysis

  • 1. Data Mining to Understand International Dimensions to Online Identity - a classification of 2+ billion names and their linkage to virtual identities and social network traffic. • Alistair Leak • UCL SECReT • a.leak.11@ucl.ac.uk
  • 2. Who am I? Education: Kingston University (BSc) - GIS UCL (M.Res) - Advanced Spatial Analysis and Visualisation UCL 3+1 - PhD Security and Crime Science Supervisors: 1st Supervisor: Professor Paul Longley 2nd Supervisor: Dr James Cheshire
  • 3. Definitions: • Netnography – “A qualitative, interpretive research methodology that uses internet-optimized ethnographic research techniques to study the social context in online communities” (Kozinets,2009) • Cybergeodemographics – “The analysis of people by where they live and by whom they interact with, in real and virtual space” (Longley, 2012)
  • 4. Uncertainty of Identity: Work Package 4: Cybergeodemographics • Use of primary and secondary data to relate virtual Internet traffic to the probable physical locations from which it emanated; and the development of typologies of social networks that are robust, generalized and related to physical locations. Secondary Data Collection Tools Data (WP1) Cybergeodemographics (WP4) Text Analytics (WP2)
  • 5. Working Title: • “Data Mining to Understand International Dimensions to Online Identity - a classification of 2+ billion names and their linkage to virtual identities and social network traffic” Objectives: • Develop spatial context of name network classification • Develop typologies of social networks • Measure how representative social media is of the underlying population.
  • 6. Work Plan • M.Res (Present – 2013) – Foundation work • Assess representative capability of tweet data – Skills Development • Spatio-Temporal Data Mining • Database Management • Ph.D (2013 – 2016) – Objectives • Develop spatial component of names networks • Develop typologies of social networks • Develop a measure of uncertainty – Completion in August 2016
  • 8. Case Study: Tweets in London • 1.4 Million Tweets over 3 months Sep - Dec 2012
  • 9. What’s in a Tweet? First Name Surname Unique ID # Themes Location Possibilities: •Political Affiliation Popularity •Gender •Age •Location Interactions Time/Date
  • 10. Data Classification • Gender – Database of 62000 names + genders – Determined by Forename • Demographic – OAC – Output area classifier • ONOMAP – Ethnicity, Religion, Geographical Origin. – Determined by Forename Surname combination
  • 12. Tweets by ONOMAP Religion
  • 13. Tweets by ONOMAP Religion
  • 15. Challenges of Study • Signal from Noise – Tweets are not all sent from individuals homes • Day and night demographics – Not all location tweets are real people • Data Quality/Sample Size – Twitter users are self selecting • Only a small proportion have enabled location services • Dataset currently has 92,000 unique users
  • 16. Target Areas of Study • Spatio-temporal differentiation of tweets – Night – Day – Travel • Expansion of the Methodology for World Names – Initially into Europe. • Application of new name datasets.
  • 17. References: • Dale, M. R. T., and M-J. Fortin. "From graphs to spatial graphs." Annual Review of Ecology, Evolution, and Systematics 41.1 (2010): 21. • Fischer, E. (July, 2011). World Map of Flikr and Twitter Locations. In See Something or Say Something. Available at http://www.flickr.com/photos/walkingsf/5912169471/in/set-72157627140310742 • http://urbantick.blogspot.co.uk/2010/12/ncl-social-networks.html • Kozinets, Robert V. Netnography: Doing ethnographic research online. Sage Publications Limited, 2009. • R Core Team (2012). R: A language and environment for statistical computing. R Foundation for • Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-project.org/. • Rao, D., Yarowsky, D., Shreevats, A., & Gupta, M. (2010, October). Classifying latent user attributes in twitter. In Proceedings of the 2nd international workshop on Search and mining user-generated contents (pp. 37-44). ACM.