- The document discusses using transactional data, like telephone call records, for social scientific research.
- Three case studies are presented that analyze telephone call data: one finds higher "localness" of calls correlates with greater deprivation, another finds weekly and fortnightly patterns in call durations, and a third attempts to classify households based on their calling patterns.
- The studies demonstrate how transactional data can provide new insights but also have limitations like non-random samples. Overall, the document argues transactional data is a promising new resource for sociology if used carefully.
La estimulación precoz se refiere a un conjunto de actividades diseñadas para estimular el desarrollo cognitivo, físico y psicológico de niños de 0 a 6 años. Se centra en áreas como el lenguaje, la motricidad, las habilidades socioemocionales y cognitivas. Los beneficios incluyen prevenir déficits en el desarrollo, fomentar la autonomía y brindar una base sólida para futuros aprendizajes. La estimulación precoz debe realizarse a través de juegos y de
El fascismo surgió en Italia después de la Primera Guerra Mundial como una ideología anti-democrática y anti-comunista que promovía un estado totalitario y nacionalista con un culto a la violencia. En Alemania, el Partido Nazi de Hitler llegó al poder en 1933 de forma legal y estableció un régimen totalitario racista basado en la supremacía de la raza aria y la expansión territorial militarista.
O documento lista os artilheiros e seus gols marcados em várias copas regionais no Rio Grande do Sul, incluindo a Copa Metropolitana, Copa Região Sul-Fronteira, e Copa Região Serrana. Lucas Silva marcou 4 gols pelo Aimoré e Hyantony marcou 11 gols pelo Passo Fundo, o artilheiro de suas respectivas copas. Vários jogadores marcaram 2 ou mais gols em suas equipes.
Regras para a realização de um trabalho escritoBibPFerreiro
Este documento fornece instruções detalhadas sobre como elaborar um trabalho escrito, incluindo como organizar o conteúdo, estruturar a apresentação e formatar o documento final. As principais seções devem incluir uma capa, índice, introdução, desenvolvimento, conclusão e bibliografia. O texto destaca a importância de seguir estas diretrizes para produzir um trabalho bem estruturado e profissional.
Talk given at University of Essex Sociology Dept's Graduate Conference 2011 exploring some of the data that provoked Savage & Burrows' 2007 paper in Sociology (soc.sagepub.com/content/41/5/885.abstract).
Social Communications in Time and SpaceBen Anderson
Presentation given at ESRC Research Centre on Micro-Social Change (http://www.iser.essex.ac.uk/research/misoc) update workshop. Uses call record data to analyse/predict multiple deprivation and also to explore temporal patterns in social communication behaviour.
The sociological value of transactional data?Ben Anderson
In this talk we will outline some of the potential sociological research value in transactional data and present for discussion some preliminary analysis of both internet usage log data from several Brazilian institutions and from UK households. We will also present some exploratory analysis of a large corpus of UK telephone call records from the late 1990s. In these cases we will offer few conclusions but rather hope to generate discussion of their potential value in sociological research and of the ethical dilemmas that surround their collection and (re)use.
La estimulación precoz se refiere a un conjunto de actividades diseñadas para estimular el desarrollo cognitivo, físico y psicológico de niños de 0 a 6 años. Se centra en áreas como el lenguaje, la motricidad, las habilidades socioemocionales y cognitivas. Los beneficios incluyen prevenir déficits en el desarrollo, fomentar la autonomía y brindar una base sólida para futuros aprendizajes. La estimulación precoz debe realizarse a través de juegos y de
El fascismo surgió en Italia después de la Primera Guerra Mundial como una ideología anti-democrática y anti-comunista que promovía un estado totalitario y nacionalista con un culto a la violencia. En Alemania, el Partido Nazi de Hitler llegó al poder en 1933 de forma legal y estableció un régimen totalitario racista basado en la supremacía de la raza aria y la expansión territorial militarista.
O documento lista os artilheiros e seus gols marcados em várias copas regionais no Rio Grande do Sul, incluindo a Copa Metropolitana, Copa Região Sul-Fronteira, e Copa Região Serrana. Lucas Silva marcou 4 gols pelo Aimoré e Hyantony marcou 11 gols pelo Passo Fundo, o artilheiro de suas respectivas copas. Vários jogadores marcaram 2 ou mais gols em suas equipes.
Regras para a realização de um trabalho escritoBibPFerreiro
Este documento fornece instruções detalhadas sobre como elaborar um trabalho escrito, incluindo como organizar o conteúdo, estruturar a apresentação e formatar o documento final. As principais seções devem incluir uma capa, índice, introdução, desenvolvimento, conclusão e bibliografia. O texto destaca a importância de seguir estas diretrizes para produzir um trabalho bem estruturado e profissional.
Talk given at University of Essex Sociology Dept's Graduate Conference 2011 exploring some of the data that provoked Savage & Burrows' 2007 paper in Sociology (soc.sagepub.com/content/41/5/885.abstract).
Social Communications in Time and SpaceBen Anderson
Presentation given at ESRC Research Centre on Micro-Social Change (http://www.iser.essex.ac.uk/research/misoc) update workshop. Uses call record data to analyse/predict multiple deprivation and also to explore temporal patterns in social communication behaviour.
The sociological value of transactional data?Ben Anderson
In this talk we will outline some of the potential sociological research value in transactional data and present for discussion some preliminary analysis of both internet usage log data from several Brazilian institutions and from UK households. We will also present some exploratory analysis of a large corpus of UK telephone call records from the late 1990s. In these cases we will offer few conclusions but rather hope to generate discussion of their potential value in sociological research and of the ethical dilemmas that surround their collection and (re)use.
Arnaldo Barreto & Ben Anderson - The sociological value of transactional data?Sociology@Essex
CRESI seminar by Arnaldo Barreto & Ben Anderson from Department of Sociology, University of Essex on 18 February 2010.
Abstract:
In this talk we will outline some of the potential sociological research value in transactional (‘radical’) data and present for discussion some preliminary analysis of both internet usage log data from several Brazilian institutions and from UK households. We will also present some exploratory analysis of a large corpus of UK telephone call records from the late 1990s. In these cases we will offer few conclusions but rather hope to generate discussion of their potential value in sociological research and of the ethical dilemmas that surround their collection and (re)use.
Presentation - Using open data to develop statistical literacy in schools - U...celiamac58
The data from Scotland’s Census 2011 provides a rich and detailed source of information for understanding Scotland and its population. The resource provides an excellent opportunity for educators and learners to explore Scotland’s past and to appreciate, as active citizens in a democratic society, how census records can be used to inform planning decisions that will shape Scotland’s future.
TCI 2015 Cluster Mapping: Pattern in Irish National and Regional Economic Act...TCI Network
This document summarizes a study analyzing patterns in Irish national and regional economic clusters using available data sources. Key findings include:
- Ireland has prominent traded clusters in biopharmaceuticals, IT, business services, and food processing concentrated in certain regions.
- Foreign-owned plants tend to have higher output, wages, and exports than Irish-owned plants within the same clusters.
- Some regions have higher wages due to stronger clusters (cluster level effect) while others rely more on mix of cluster industries (cluster mix effect).
- Data limitations hamper comprehensive cluster analysis, particularly for services, and better data could inform regional economic policy.
Linking Socio-economic and Demographic Characteristics to Twitter Topics - Gu...Guy Lansley
Social media data is now widely considered a viable source for market and social research. Everyday Twitter’s users generate large quantities of data through Tweet messages which express the users’ thoughts and opinions, and may also describe their activity, plans and location. In its raw form, textual data at this volume is hard to process and understand, however, it is possible to model the Tweets into a small number of topics using generative probabilistic algorithms. This paper aims to research how the content of Tweets may vary by socio-economics and demographic characteristics using Tweets from Inner London sourced from the Twitter application programming interface.
Earlier research has successfully allocated over 1 million geo-located Tweets from Inner London in 2013 into a hierarchical classification of 20 groups and 100 subgroups created using a latent dirichlet allocation algorithm. The 20 groups consist of distinctive topics and uses of language, and they all demonstrate unique spatial and temporal patterns across Inner London. The next stage of the analysis explores how the Twitter classification varies across the residential geography of Inner London. Assuming that most Tweets sourced from residential buildings are likely to be sourced by residents, the classification can be compared to socio-economic and other demographic characteristics from open data sources. In addition, some characteristics such as gender and ethnicity can also be inferred from the names of Twitter users.
Presentation - Using open data to develop statistical literacy in schools - c...celiamac58
This document provides an overview of how to use Scotland's Census data to develop statistical literacy in schools. It outlines how to access census findings through parliamentary profiles, topic summaries, and the statistics cycle. It demonstrates how to use the Census Data Explorer website to analyze area profiles and standard outputs. Users can examine household compositions and characteristics by area and age/sex through these tools.
Zones are geographic divisions used to aggregate statistical data and represent areas on maps. Zone design determines which statistical units like people and households are grouped together, and can impact observed relationships between variables. The configuration and scale of zones can affect election results and observed health correlations due to the modifiable areal unit problem. Researchers using zone-based data should understand how the zones were created and consider designing their own zones to suit their research objectives.
Monitoring Internal Migration in the United KingdomUKDSCensus
Internal migration in the UK saw modest declines in intensity from 2000-2001 to 2010-2011 according to census and administrative data, with the largest decreases for those aged 45-59. Inter-district migration distances also decreased slightly. When compared internationally, the UK's aggregate migration intensity was average. The document advocates for greater use of origin-destination census data, cross-national migration research, and consideration of scale and zonation effects in internal migration studies.
This document summarizes population trends in the UK and how population is measured and mapped from national to local scales. It discusses key trends like population growth, aging population, and geographical variation. It also describes how the census and other administrative data are used to measure and understand population changes and challenges in counting population in the future with fewer resources.
Presentation given at ESRC Sustainable Practices Research Group "Studying sustainable practices: methodological challenges and puzzles " Workshop Tuesday 22nd March, 2011, Lancaster.
Big Data in Economic Research: Twitter, Phone calls and Political eventsPhDSofiaUniversity
This document summarizes research using various types of big data including call detail records, political event data, and Twitter data. It discusses how call detail records have been used to study the spread of diseases, optimize transportation networks, and track population displacement. Political event data and mining data have been combined to examine how minerals fuel local conflicts in Africa. Twitter data has been analyzed to map language distributions in Europe and study international migration patterns, such as tracking the migration of Venezuelans during an economic crisis.
Seriously Mixed Methods - a GRIDy Challenge?Ben Anderson
Paper presented at the ESRC Research Methods Programme Seminar 'Combining data: using advanced technology to enhance social science resources ' , Dec 18th 2002, University of Manchester.
Sutton in partnership presentation residents survey and one planet sutton -...SuttoninPartnership
This document summarizes the results of the 2013 Sutton Residents' Survey conducted by Ipsos MORI. Key findings include:
- Overall satisfaction with the local area as a place to live remains high at 91%, on par with previous years. However, parking, dog fouling and speeding are the top local area problems.
- Nine in ten residents think their local area is cohesive and people from different backgrounds get along.
- Three in four residents are satisfied with how Sutton Council runs things, higher than many other London boroughs. However, less than half feel they can influence council services.
- Residents continue to feel safe in their neighborhood during the day and feelings of safety after dark are
This is my presentation to the 20th ITA telework workshop. It reviews telework developments since the first workshop in 1996, with particular focus on the UK.
SAGT Conference 2015 - Scottish Government Stats and teachingceliamac58
1) The Scottish Government produces statistics to provide an accurate picture of the economy and society and ensure integrity and impartiality.
2) Census data provides information on population, households, housing, health, ethnicity, education, and more. This data is available through profiles, visualizations, maps, and tables.
3) Comparing trends in household composition over time using Census data could provide insights into issues like increasing one-person households and implications for communities.
This document outlines a study that aims to examine the relationship between income levels and ecological footprint in a village and city in India. The study will survey 30-40 households in each area using a standardized questionnaire to collect data on income and factors contributing to ecological footprint. Random sampling will be used to select households. Data will be analyzed using graphs to observe how income impacts ecological footprint between the two areas. The results could help determine how income correlates with ecological footprint and environmental impact.
Employment leakage by local government are in the Northern Territory, Austral...Ninti_One
Blackwell, B, Fischer, A, McFarlane, J and Dollery B (in press). 'Employment leakage by Local Government Area in the Northern Territory, Australia: The roles of industries, including mining', Journal of Developing Areas (2015)
Introduction to Census data and practical applications - Geography Skills Abe...celiamac58
This course outlines the results from the 2011 Census in Scotland, and demonstrates a number of ways of accessing and using the published resources. The course will focus on the Scottish results published through the Census Data Explorer, and will use a number of case studies to illustrate how the data can be used to answer specific questions.
Statistics Sweden is responding to the Covid-19 pandemic by releasing tailored statistics and analyses to inform government agencies and the public. They are providing support to the Public Health Agency of Sweden through data enrichment and spatial analysis of microdata on infected individuals. This includes analyzing characteristics like socioeconomic status, household type, and country of origin. Statistics Sweden is also using mobile network data from Telia, in collaboration, to study changes in mobility and activity patterns during the pandemic.
Using Time Use Data To Trace 'Energy Practices' Through TimeBen Anderson
The document discusses trends in energy demand (DEMAND) over time based on time use survey data from 1974 to 2005. It analyzes 10 activity classes and shows changes over time, with declines in activities like travel, cooking and eating, and increases in media use. Food preparation trends are examined in more detail, showing peaks shifting from the evening to the late morning on weekends. The analysis of time use data provides insights into how energy-demanding activities and practices have changed over the past several decades.
Modeling Water Demand in Droughts (in England & Wales)Ben Anderson
This document describes an agent-based microsimulation model for estimating domestic water demand under drought conditions in the UK. The model simulates individual households and factors that influence water usage, such as household attributes, appliances, practices, pricing, and drought interventions. Preliminary results show that including drought responses can reduce total water demand by 5% compared to not including responses. Further development of the model will add more influencing factors and link it to drought forecasts to better estimate future water demand scenarios.
More Related Content
Similar to On the social scientific value of transactional data
Arnaldo Barreto & Ben Anderson - The sociological value of transactional data?Sociology@Essex
CRESI seminar by Arnaldo Barreto & Ben Anderson from Department of Sociology, University of Essex on 18 February 2010.
Abstract:
In this talk we will outline some of the potential sociological research value in transactional (‘radical’) data and present for discussion some preliminary analysis of both internet usage log data from several Brazilian institutions and from UK households. We will also present some exploratory analysis of a large corpus of UK telephone call records from the late 1990s. In these cases we will offer few conclusions but rather hope to generate discussion of their potential value in sociological research and of the ethical dilemmas that surround their collection and (re)use.
Presentation - Using open data to develop statistical literacy in schools - U...celiamac58
The data from Scotland’s Census 2011 provides a rich and detailed source of information for understanding Scotland and its population. The resource provides an excellent opportunity for educators and learners to explore Scotland’s past and to appreciate, as active citizens in a democratic society, how census records can be used to inform planning decisions that will shape Scotland’s future.
TCI 2015 Cluster Mapping: Pattern in Irish National and Regional Economic Act...TCI Network
This document summarizes a study analyzing patterns in Irish national and regional economic clusters using available data sources. Key findings include:
- Ireland has prominent traded clusters in biopharmaceuticals, IT, business services, and food processing concentrated in certain regions.
- Foreign-owned plants tend to have higher output, wages, and exports than Irish-owned plants within the same clusters.
- Some regions have higher wages due to stronger clusters (cluster level effect) while others rely more on mix of cluster industries (cluster mix effect).
- Data limitations hamper comprehensive cluster analysis, particularly for services, and better data could inform regional economic policy.
Linking Socio-economic and Demographic Characteristics to Twitter Topics - Gu...Guy Lansley
Social media data is now widely considered a viable source for market and social research. Everyday Twitter’s users generate large quantities of data through Tweet messages which express the users’ thoughts and opinions, and may also describe their activity, plans and location. In its raw form, textual data at this volume is hard to process and understand, however, it is possible to model the Tweets into a small number of topics using generative probabilistic algorithms. This paper aims to research how the content of Tweets may vary by socio-economics and demographic characteristics using Tweets from Inner London sourced from the Twitter application programming interface.
Earlier research has successfully allocated over 1 million geo-located Tweets from Inner London in 2013 into a hierarchical classification of 20 groups and 100 subgroups created using a latent dirichlet allocation algorithm. The 20 groups consist of distinctive topics and uses of language, and they all demonstrate unique spatial and temporal patterns across Inner London. The next stage of the analysis explores how the Twitter classification varies across the residential geography of Inner London. Assuming that most Tweets sourced from residential buildings are likely to be sourced by residents, the classification can be compared to socio-economic and other demographic characteristics from open data sources. In addition, some characteristics such as gender and ethnicity can also be inferred from the names of Twitter users.
Presentation - Using open data to develop statistical literacy in schools - c...celiamac58
This document provides an overview of how to use Scotland's Census data to develop statistical literacy in schools. It outlines how to access census findings through parliamentary profiles, topic summaries, and the statistics cycle. It demonstrates how to use the Census Data Explorer website to analyze area profiles and standard outputs. Users can examine household compositions and characteristics by area and age/sex through these tools.
Zones are geographic divisions used to aggregate statistical data and represent areas on maps. Zone design determines which statistical units like people and households are grouped together, and can impact observed relationships between variables. The configuration and scale of zones can affect election results and observed health correlations due to the modifiable areal unit problem. Researchers using zone-based data should understand how the zones were created and consider designing their own zones to suit their research objectives.
Monitoring Internal Migration in the United KingdomUKDSCensus
Internal migration in the UK saw modest declines in intensity from 2000-2001 to 2010-2011 according to census and administrative data, with the largest decreases for those aged 45-59. Inter-district migration distances also decreased slightly. When compared internationally, the UK's aggregate migration intensity was average. The document advocates for greater use of origin-destination census data, cross-national migration research, and consideration of scale and zonation effects in internal migration studies.
This document summarizes population trends in the UK and how population is measured and mapped from national to local scales. It discusses key trends like population growth, aging population, and geographical variation. It also describes how the census and other administrative data are used to measure and understand population changes and challenges in counting population in the future with fewer resources.
Presentation given at ESRC Sustainable Practices Research Group "Studying sustainable practices: methodological challenges and puzzles " Workshop Tuesday 22nd March, 2011, Lancaster.
Big Data in Economic Research: Twitter, Phone calls and Political eventsPhDSofiaUniversity
This document summarizes research using various types of big data including call detail records, political event data, and Twitter data. It discusses how call detail records have been used to study the spread of diseases, optimize transportation networks, and track population displacement. Political event data and mining data have been combined to examine how minerals fuel local conflicts in Africa. Twitter data has been analyzed to map language distributions in Europe and study international migration patterns, such as tracking the migration of Venezuelans during an economic crisis.
Seriously Mixed Methods - a GRIDy Challenge?Ben Anderson
Paper presented at the ESRC Research Methods Programme Seminar 'Combining data: using advanced technology to enhance social science resources ' , Dec 18th 2002, University of Manchester.
Sutton in partnership presentation residents survey and one planet sutton -...SuttoninPartnership
This document summarizes the results of the 2013 Sutton Residents' Survey conducted by Ipsos MORI. Key findings include:
- Overall satisfaction with the local area as a place to live remains high at 91%, on par with previous years. However, parking, dog fouling and speeding are the top local area problems.
- Nine in ten residents think their local area is cohesive and people from different backgrounds get along.
- Three in four residents are satisfied with how Sutton Council runs things, higher than many other London boroughs. However, less than half feel they can influence council services.
- Residents continue to feel safe in their neighborhood during the day and feelings of safety after dark are
This is my presentation to the 20th ITA telework workshop. It reviews telework developments since the first workshop in 1996, with particular focus on the UK.
SAGT Conference 2015 - Scottish Government Stats and teachingceliamac58
1) The Scottish Government produces statistics to provide an accurate picture of the economy and society and ensure integrity and impartiality.
2) Census data provides information on population, households, housing, health, ethnicity, education, and more. This data is available through profiles, visualizations, maps, and tables.
3) Comparing trends in household composition over time using Census data could provide insights into issues like increasing one-person households and implications for communities.
This document outlines a study that aims to examine the relationship between income levels and ecological footprint in a village and city in India. The study will survey 30-40 households in each area using a standardized questionnaire to collect data on income and factors contributing to ecological footprint. Random sampling will be used to select households. Data will be analyzed using graphs to observe how income impacts ecological footprint between the two areas. The results could help determine how income correlates with ecological footprint and environmental impact.
Employment leakage by local government are in the Northern Territory, Austral...Ninti_One
Blackwell, B, Fischer, A, McFarlane, J and Dollery B (in press). 'Employment leakage by Local Government Area in the Northern Territory, Australia: The roles of industries, including mining', Journal of Developing Areas (2015)
Introduction to Census data and practical applications - Geography Skills Abe...celiamac58
This course outlines the results from the 2011 Census in Scotland, and demonstrates a number of ways of accessing and using the published resources. The course will focus on the Scottish results published through the Census Data Explorer, and will use a number of case studies to illustrate how the data can be used to answer specific questions.
Statistics Sweden is responding to the Covid-19 pandemic by releasing tailored statistics and analyses to inform government agencies and the public. They are providing support to the Public Health Agency of Sweden through data enrichment and spatial analysis of microdata on infected individuals. This includes analyzing characteristics like socioeconomic status, household type, and country of origin. Statistics Sweden is also using mobile network data from Telia, in collaboration, to study changes in mobility and activity patterns during the pandemic.
Using Time Use Data To Trace 'Energy Practices' Through TimeBen Anderson
The document discusses trends in energy demand (DEMAND) over time based on time use survey data from 1974 to 2005. It analyzes 10 activity classes and shows changes over time, with declines in activities like travel, cooking and eating, and increases in media use. Food preparation trends are examined in more detail, showing peaks shifting from the evening to the late morning on weekends. The analysis of time use data provides insights into how energy-demanding activities and practices have changed over the past several decades.
Modeling Water Demand in Droughts (in England & Wales)Ben Anderson
This document describes an agent-based microsimulation model for estimating domestic water demand under drought conditions in the UK. The model simulates individual households and factors that influence water usage, such as household attributes, appliances, practices, pricing, and drought interventions. Preliminary results show that including drought responses can reduce total water demand by 5% compared to not including responses. Further development of the model will add more influencing factors and link it to drought forecasts to better estimate future water demand scenarios.
A Social Practices-based Microsimulation Model for Estimating Domestic Hot Wa...Ben Anderson
This document describes an IMPETUS microsimulation model developed to estimate domestic hot water demand in the UK. The model takes into account factors like weather, demographics, technology changes, and personal water usage behaviors. It aims to better address uncertainties around climate change, social practices, and responses to water and energy efficiency measures. Heating water accounts for over half of residential water usage in the UK and is the second largest energy use in homes. The model suggests that installing efficient showerheads can substantially reduce both water consumption and energy demand. Future work will assess the impacts of other water efficiency interventions and behavior changes on energy usage.
This document discusses a project called Solent Achieving Value from Efficiency (SAVE) that aims to reduce energy usage through LED lighting upgrades and monitor the financial and energy savings. It will conduct a trial evaluation including surveys and collecting energy usage data to analyze the results and share updates.
SAVE: A large scale randomised control trial approach to testing domestic ele...Ben Anderson
The document describes a study that tested demand response interventions in the UK using a large randomized controlled trial approach. Over 4,000 households were recruited and randomly assigned to control and intervention groups. Initial results found that messages encouraging shifting electricity use away from peak hours had little impact, while the addition of a financial incentive reduced consumption by up to 5% in the targeted hours. The study is using high frequency electricity consumption data and modeling techniques to analyze flexibility at a local level, which could help target interventions and inform network investment decisions.
Hunting for (energy) demanding practices using big & medium sized dataBen Anderson
Dr. Ben Anderson presents on hunting for energy demanding practices using big and medium sized data. He discusses linking transactional energy use data to surveys and time use studies to identify household practices that influence demand, such as cooking and cleaning routines. New sources of data like smart meter traces are also explored for revealing rhythms of demand. Challenges include scaling up micro-level insights and taking post-disciplinary approaches to analyze data of different sizes.
Electricity consumption and household characteristics: Implications for censu...Ben Anderson
Presentation given at MRS Workshop "Can Big Data replace the Census? What does Big Data give us now?" , March 7, 2016, MRS, London (https://www.mrs.org.uk/event/conferences/can_big_data_replace_the_census/course/4088/id/10035)
Small Area Estimation as a tool for thinking about temporal and spatial varia...Ben Anderson
Anderson, B (2014) Small Area Estimation as a tool for thinking about temporal and spatial variation in energy demand. Paper presented at AURIN/NATSEM Microsimulation Workshop, University of Melbourne, Thursday 4th December 2014
The Time and Timing of UK Domestic Energy DEMANDBen Anderson
Anderson, B. (2014) The Time and Timing of UK Domestic Energy DEMAND. Keynote paper presented at the 2014 Otago Energy Research Centre Symposium, University of Otago, Dunedin, New Zealand, 28/11/2014.
PRACTICE HUNTING: Time Use Surveys for a quantification of practices distribu...Ben Anderson
Mathieu Durand-Daubin (EDF R&D-ECLEER)
Ben Anderson (Southampton University)
Paper presented at BEHAVE 2014, Said Business School, Oxford, 3rd September 2014
Census2022: Extracting value from domestic consumption data in a postcensus eraBen Anderson
This document discusses a project called Census2022 that aims to extract value from domestic consumption data from smart meters in a post-census era. It details how smart meter data at high temporal resolution could be aggregated to small geographic areas to generate household statistics and indicators. The document then describes a study conducted with smart meter-like household electricity consumption data from 180 UK homes. Preliminary analysis of load profile indicators showed differences between households of varying sizes and employment statuses. However, more complex models are needed to better predict household characteristics from electricity use alone. Future steps involve accessing larger datasets and creating novel energy-based social indicators.
The Rhythms and Components of ‘Peak Energy’ DemandBen Anderson
Ben Anderson – University of Southampton (@dataknut)
Jacopo Torriti – University of Reading
Richard Hanna – University of Reading
Paper presented at BEHAVE 2014, Said Business School, Oxford, 3rd September 2014.
Tracking Social Practices with Big(ish) dataBen Anderson
Paper presented at 'Methodology' session of PRACTICES, THE BUILT ENVIRONMENT AND SUSTAINABILITY EARLY CAREER RESEARCHER NETWORK Workshop,
26-27 June 2014, Cambridge
Do ‘eco’ attitudes & behaviours explain the uptake of domestic energy product...Ben Anderson
"Do ‘eco’ attitudes & behaviours explain the uptake of domestic energy production technologies?"
Paper presented at "What Makes Us Act Green?", June 25 2014, London
Small Area Estimation as a tool for thinking about spatial variation in energ...Ben Anderson
Paper presented at "Spatial Variation in Energy Use, Attitudes and Behaviours: Implications for Smart Grids and Energy Demand", Policy Studies Institute, Friday, 7 February 2014, London, United Kingdom
The Distribution of Domestic Energy-Tech in Great Britain: 2008 – 2011Ben Anderson
This document summarizes research on the distribution of domestic energy technologies in Great Britain from 2008 to 2011. It finds that adopters of these technologies tend to be home-owning residents of detached, rural homes who report environmentally friendly views and actions. Those seriously considering adoption also tend to have higher energy spending. Rejecters of these technologies are more often urban renters living in non-detached housing who report less environmentally friendly attitudes. The research finds some differences between adopters of solar PV and solar thermal technologies and highlights equity issues in the uneven distribution of benefits from these technologies.
Thinking of getting a dog? Be aware that breeds like Pit Bulls, Rottweilers, and German Shepherds can be loyal and dangerous. Proper training and socialization are crucial to preventing aggressive behaviors. Ensure safety by understanding their needs and always supervising interactions. Stay safe, and enjoy your furry friends!
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Assessment and Planning in Educational technology.pptxKavitha Krishnan
In an education system, it is understood that assessment is only for the students, but on the other hand, the Assessment of teachers is also an important aspect of the education system that ensures teachers are providing high-quality instruction to students. The assessment process can be used to provide feedback and support for professional development, to inform decisions about teacher retention or promotion, or to evaluate teacher effectiveness for accountability purposes.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...PECB
Denis is a dynamic and results-driven Chief Information Officer (CIO) with a distinguished career spanning information systems analysis and technical project management. With a proven track record of spearheading the design and delivery of cutting-edge Information Management solutions, he has consistently elevated business operations, streamlined reporting functions, and maximized process efficiency.
Certified as an ISO/IEC 27001: Information Security Management Systems (ISMS) Lead Implementer, Data Protection Officer, and Cyber Risks Analyst, Denis brings a heightened focus on data security, privacy, and cyber resilience to every endeavor.
His expertise extends across a diverse spectrum of reporting, database, and web development applications, underpinned by an exceptional grasp of data storage and virtualization technologies. His proficiency in application testing, database administration, and data cleansing ensures seamless execution of complex projects.
What sets Denis apart is his comprehensive understanding of Business and Systems Analysis technologies, honed through involvement in all phases of the Software Development Lifecycle (SDLC). From meticulous requirements gathering to precise analysis, innovative design, rigorous development, thorough testing, and successful implementation, he has consistently delivered exceptional results.
Throughout his career, he has taken on multifaceted roles, from leading technical project management teams to owning solutions that drive operational excellence. His conscientious and proactive approach is unwavering, whether he is working independently or collaboratively within a team. His ability to connect with colleagues on a personal level underscores his commitment to fostering a harmonious and productive workplace environment.
Date: May 29, 2024
Tags: Information Security, ISO/IEC 27001, ISO/IEC 42001, Artificial Intelligence, GDPR
-------------------------------------------------------------------------------
Find out more about ISO training and certification services
Training: ISO/IEC 27001 Information Security Management System - EN | PECB
ISO/IEC 42001 Artificial Intelligence Management System - EN | PECB
General Data Protection Regulation (GDPR) - Training Courses - EN | PECB
Webinars: https://pecb.com/webinars
Article: https://pecb.com/article
-------------------------------------------------------------------------------
For more information about PECB:
Website: https://pecb.com/
LinkedIn: https://www.linkedin.com/company/pecb/
Facebook: https://www.facebook.com/PECBInternational/
Slideshare: http://www.slideshare.net/PECBCERTIFICATION
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Macroeconomics- Movie Location
This will be used as part of your Personal Professional Portfolio once graded.
Objective:
Prepare a presentation or a paper using research, basic comparative analysis, data organization and application of economic information. You will make an informed assessment of an economic climate outside of the United States to accomplish an entertainment industry objective.
The simplified electron and muon model, Oscillating Spacetime: The Foundation...RitikBhardwaj56
Discover the Simplified Electron and Muon Model: A New Wave-Based Approach to Understanding Particles delves into a groundbreaking theory that presents electrons and muons as rotating soliton waves within oscillating spacetime. Geared towards students, researchers, and science buffs, this book breaks down complex ideas into simple explanations. It covers topics such as electron waves, temporal dynamics, and the implications of this model on particle physics. With clear illustrations and easy-to-follow explanations, readers will gain a new outlook on the universe's fundamental nature.
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
On the social scientific value of transactional data
1. On the social scientific value of
transactional data
Ben Anderson* & Alexei Vernitski†
*
Department of Sociology
†
Department of Mathematical Sciences
26 May 2011
2. Outline
What do we mean?
A 21st Century Sociology?
Case studies:
• Do deprived areas have different telephone calling patterns?
• Do households have similar calling patterns?
• Can we usefully classify households' calling patterns?
Future directions
3. What do we mean?
Transactional data:
• Generated by everyday life
• Automatically captured as part of 'business as usual'
• N = millions
• Billions of data points
Literature commentary:
• Surveillance, Computer Science
4. What do we mean?
Transactional data:
• Generated by everyday life
• Automatically captured as part of 'business as usual'
• N = millions
• Billions of data points
Literature commentary:
• Surveillance, Computer Science
• Social Science
• Savage & Burrows, 2007
• doi:10.1177/0038038507080443
• 101 citations (Google Scholar)
• http://www.youtube.com/watch?v=ARLARDwLJhw
6. Traditional uses
“PayCheck profiles all 1.6 million postcodes in the UK
using information on over 4 million households from
lifestyle surveys and Census and Market Research data.
It is available as a mean, median and mode figure for
each postcode or as a PayCheck type.
PayCheck can be used for:
• Selecting names and addresses from the Ocean
• Coding up customer records for profiling or campaign
selections
• Profiling to understand how your customer group
compares to the rest of the UK population”
“Mosaic UK classifies consumers by household
or postcode, allowing you to optimise the use of the
segmentation depending upon the application.
• 46% of the data used to build Mosaic is nonCensus sourced information that is updated
annually. This enables Mosaic to monitor changes
in consumer behaviour and incorporate these each
year within the classification.
• Mosaic UK is validated by a comprehensive
programme of fieldwork and observational
research covering each of the UK's 120 postal
areas.”
8. A 21st Century Sociology?
• Re-assessing old questions
• Networks, place, space & social relationships (capital)
• Consumption, leisure and class?
• Public performance of self?
• Imagining new questions?
• Software & social stratification?
• ?
9. A 21st Century Sociology?
• Re-assessing old questions
• Networks, place, space & social relationships (capital)
• Consumption, leisure and class?
• Public performance of self?
• Imagining new questions?
• Software & social stratification?
• ?
• What might our students need to know?
• Data provenance & politics?
• Imputation, visualisation, signal analysis…?
• An Industrial Sociology?
10. Case study data
•
BT 100,000 data
•
•
•
sample of 103,113 households covering all of the UK in 1995
All outgoing billable calls recorded for the months of October 1995,
March 1996, October 1996, March 1997, October 1997, March 1998
Linked to Customer data – postcode, billing flags, ACORN code
11. Case study data
•
BT 100,000 data
•
•
•
•
sample of 103,113 households covering all of the UK in 1995
All outgoing billable calls recorded for the months of October 1995,
March 1996, October 1996, March 1997, October 1997, March 1998
Linked to Customer data – postcode, billing flags, ACORN code
BT/Essex Home OnLine household panel data
•
•
•
representative sample of 1000 households covering all of GB in 1998
3 wave household panel survey (1998/1999/2000)
All incoming & outgoing billable calls recorded for 438 of the 1000
households who:
a)
b)
were BT customers and
gave consent for call records to be linked to survey
12. Data: BT 100,000
•
•
October 1995 +
October/March 1996/7/8
A sample, not complete
coverage
England Northern I reland S cotland
In first sample (Oct 95) only
In first & second samples only
…
In all samples
70,195
10,305
11,485
Total
70,195
10,305
11,485
W ales
11,131
11,131
Unknow n ( phone Unknow n
number
( postcode
unmatched)
unmatched)
1,606
2,245
1,149
11,245
Useful longitudinal sample = 103,116
261
261
Total
1,606
2,245
104,526
114,622
13. Data: BT 100,000 sample
Aggregated to
OAs
Datazones
LSOA (Eng & Wales)
SOA (NI)
Datazone (Scotland)
14. Data: BT 100,000 sample
•
•
•
October 1995 +
October/March 1996/7/8
A sample, not complete
coverage
So is it a representative
sample?
15. Data: BT 100,000 sample
•
October 1995 +
October/March 1996/7/8
A sample, not complete
coverage
•
16.0%
14.0%
•
So is it a representative
sample?
12.0%
% of sampled households
(unfiltered)
10.0%
% of sampled households
(filtered)
8.0%
% of all households
6.0%
•
Sample tends to have fewer at
the margins, especially in the
most deprived areas.
4.0%
2.0%
0.0%
0
1
2
3
4
5
6
7
8
9
IMD 2004 decile
•
•
•
•
IMD = Index of Multi[ple Deprivation (area level Government statistics)
England data only (n = 32,482 LSOAs of which 1,743 contained at least one sampled number and 593 contained over 33 or 7.5%)
70,195 households/numbers (unfiltered), 51,118 (filtered)
Filter removes areas where number of households logged > 30
16. Data: BT ‘100,000’ sample selection bias?
• Areas where calls collected more likely to have…
• Slightly higher % of households in higher socio-economic groups
• Lower % of households with 2+ cars
• Live in lower IMD employment deprivation scores
• West Midlands & Yorkshire location
• Implications:
• Generalisability?
• Ideally we’d want a random sample
But…
•
•
•
England data only (n = 32,482 LSOAs of which 1,743 contained at least one sampled number and 593 contained over 33 or 7.5%)
70,195 households/numbers (unfiltered), 51,118 (filtered)
Filter removes areas where number of households logged > 30
17. Data: Home OnLine Household Panel
•
•
June 1998 - July 2001, 999 households
A representative sample by design
partner, kids > 15
partner, kids > 11
partner, kids < 12
partner , no kids, > 55
partner , no kids, < 56
couple, no kids, aged < 36
lone parent , all kids > 15
lone parent , all kids < 16
un-related others
alone over 55
alone under 56
0
5
10
15
% of households
Un-monitored
Monitored
20
•
A total of 438 households were
monitored at some point
18. Outline
What do we mean?
A 21st Century Sociology?
Case studies:
• Do deprived areas have different telephone calling patterns?
• Do households have similar calling patterns?
• Can we usefully classify households' calling patterns?
Future directions
19. Case study 1: Deprivation & Communication
Network Diversity and Economic
Development
Nathan Eagle, et al. Science 328,
1029 (2010); DOI:
10.1126/science.1186605
20. Case study 1: Deprivation & Communication
•
Our approach:
•
Simplified ‘localness’ metric:
–
–
•
Ratio of local to national calls (ratio = n local† / n national)
Higher localness -> lower economic opportunity?
BT 100,000 data
–
•
March 1998 data to match IMD 2004 data
Local call = same dialing code area; Regional calls ~= Government Office Region (GOR); National calls ~= between GORs. Not
geographic distance.
†
21. Case Study 1: ‘Localness’ and ‘deprivation’
•
•
•
•
•
•
•
IMD 2004
% calls
Filtered
Distance
wasn’t dead in
1998
LSOA level
March 1998, England data only (1,743 LSOAs contained at least one sampled number and 593 contained over 33
or 7.5%). 1,743 LSOAs = 5M calls, 593 LSOAs = 3.7M calls
IMD 2004 (2001 data)
22. Case Study 1: ‘Localness’ by region
•
•
•
Household level, highest 10% of ‘Localness’ (outliers) removed - predominantly in Wales & Northern Ireland
March 1998, All = 78,127 ‘households’ (11,737 made no local calls and 30,894 made no national calls), England =
50,971 ‘households’
IMD 2004 (2001 data)
23. Case Study 1: ‘Localness’ and deprivation
•
•
•
Household level, highest 10% of ‘Localness’ (outliers) removed - predominantly in Wales & Northern Ireland
March 1998, All = 78,127 ‘households’ (11,737 made no local calls and 30,894 made no national calls), England =
50,971 ‘households’
IMD 2004 (2001 data)
24. Case Study 1: ‘Localness’ and ‘employment deprivation’:
•
•
‘Localness’
Ratio of
national to
local calls (=
local*/national)
•
IMD 2004
– Employment score
– R = 0.53
•
•
•
•
Household level, highest 10% of ‘Localness’ (outliers) removed - predominantly in Wales & Northern Ireland
March 1998, All = 78,127 ‘households’ (11,737 made no local calls and 30,894 made no national calls), England = 50,971 ‘households’
IMD 2004 (2001 data)
*Local call = same dialing code area; Regional calls ~= between GORs. Not geographic distance.
25. Case study 1: Deprivation and communication
• Probably!
• Localness measure
• For
both:
• Higher rates correlate (+) with deprivation
• Especially employment deprivation
26. Outline
What do we mean?
A 21st Century Sociology?
Case studies:
• Do deprived areas have different telephone calling patterns?
• Do households have similar calling patterns?
• Can we usefully classify households' calling patterns?
Future directions
27. Case study 2: Patterns of communication
• Why?
• Habits & rhythms of life…
• What’s all this about?
BT “100,000” data – sample of 103,113 households
October 1995 call data (7,935,195 calls)
28. Case study 2: Patterns of communication
BT “100,000” data – sample of 103,113 households covering all of the UK in 1995
Local calls = n/10
October 1995 data (7,935,195 calls)
29. Case study 2: Patterns of communication
BT “Home OnLine Panel” households
6
•
500
Sundays:
•
Longer calls
•
Fewer calls
•
But what’s this?
•
A data blip!
450
5
300
3
250
200
2
Mean call duration
350
4
150
100
1
50
5-Apr
29-Mar
22-Mar
15-Mar
6-Mar
1-Mar
22-Feb
15-Feb
8-Feb
1-Feb
25-Jan
18-Jan
0
11-Jan
0
4-Jan
Mean calls per household
400
Day
Mean number of calls per household
•
•
Mean call duration (s)
BT “Home OnLine Panel” data (423 households) - slightly more likely to be ‘alone over 55’ and living in a semi-detached
house/bungalow
January - April 2000
30. Case study 2: Patterns of communication
Duration of calls…
•
Mean call duration (s)
1.0
Autocorrelation analysis:
• Every 7th day is similar
•
•
.5
Every 14th day is similar
•
Partial ACF
0.0
-.5
1
3
5
7
9
11
13
15
17
19
21
23
25
27
29
31
33
35
-1.0
Confidence Limits
Coefficient
Lag Number
BT “Home Online Panel” data – 1999/2000 excerpt
•
weekly autocorrelation
fortnightly autocorrelation
even after allowing for
weekly autocorrelation
Humans are creatures of
(localised) habit!
31. Case study 2: Patterns of communication
• Biological networks:
• Many nodes with few connections &
few nodes with many connections: y =
a/x or y = a/x2 (log-linear or ‘scalefree’)
• Human calling networks:
• Same principle
• McCarty et al (2001) Comparing two
methods for estimating network size,
Human Organization; Spring 2001;
60, 1; pg. 28
• Do we find the same thing?
32. Case study 2: Patterns of communication
• Outgoing calls
8
• X: log (number of calls)
• Y: log (number of telephone
numbers)
7
6
5
4
400
3
350
2
300
250
200
y = -0.9494x +
7.2
1
0
R2 = 0.9289
0
150
2
• Informally:
100
50
0
0
•
•
1
100
200
300
400
500
600
700
800
900
BT “Home Online Panel” data – 1999/2000 excerpt
Data for 1 ‘typical’ household
3
4
5
6
7
8
• 1/2 the numbers called are
called once
• Of the rest 1/3 are called twice
• Of the remainder 1/4 are called
three times
• etc
33. Case study 2: Patterns of communication
• Outgoing calls and Incoming calls show roughly the
same pattern
•
•
BT “Home Online Panel” data – 1999/2000 excerpt
Data for 1 ‘typical’ household
34. Outline
What do we mean?
A 21st Century Sociology?
Case studies:
• Do deprived areas have different telephone calling patterns?
• Do households have similar calling patterns?
• Can we usefully classify households' calling patterns?
Future directions
35. Case study 3: Classifying households
• Incoming
• Outgoing
• ‘Mirror’ curves for two households
•
•
BT “Home Online Panel” data – 1999/2000 excerpt
Data for 1 ‘typical’ household
36. Case study 3: Classifying households
• ‘Mirror’ curves
for several
households
4045
4.5
4
3.5
3
2.5
2
Series1
1.5
1
0.5
0
(4
•
BT “Home Online Panel” data – 1999/2000 excerpt
(3
(2
(1
0
1
37. Case study 3: Classifying households
• Can we use these curves to classify households?
• Of the 391 who match to wave 1 survey data, 16 are ‘unusual
shapes’
• E.g. ‘4045’
4045
•
•
•
•
•
•
•
•
•
•
•
Single person
White male
Aged 46
Lives alone, divorced
Speaks to neighbours at least once a week
0 local friends
4 non local friends
2 local relatives
12 non-local relatives
'4
BT “Home Online Panel” data – 1999/2000 excerpt
Data for 1 ‘typical’ household
'3
5
4
3
Series1
2
1
0
'2
'1
0
1
38. Case study 3: Classifying households
• Of the 16 ‘unusual shapes’
• Predominantly single adults
• Alone > 55
• Alone < 55
• Lone parent with young
child(ren)
• Why?
partner+kid>15
partner+kid>11
partner+kid<12
part,nokid,>55
part,nokid,<56
part,nokid,<36
lonepar,kid>15
lonepar,kid<16
un-other rel
alone over 55
alone under 56
0
5
10
15
20
% of households
"Normal"
•
•
BT “Home Online Panel” data – 1999/2000 excerpt
Data for 1 ‘typical’ household
"Unusual"
25
30
39. Outline
What do we mean?
A 21st Century Sociology?
Case studies:
• Do deprived areas have different telephone calling patterns?
• Do households have similar calling patterns?
• Can we usefully classify households' calling patterns?
Future directions
40. Future directions:
• Analysis of call duration
• Do we get the same shape?
• How do we combine
frequency & duration?
• Can the curves predict
anything about the
households?
42. Future directions:
• Network characteristics in places
• Do people in different kinds of places have different kinds of (ego)
networks
• Does this change over time?
• Can we represent a place by some combination of ego networks?
• To what extent do ego networks overlap
• And how do we deal with the sampling issue?
43. Future directions:
• Network characteristics in places
• Do people in different kinds of places have different kinds of (ego)
networks
• Does this change over time?
• Can we represent a place by some combination of ego networks?
• To what extent do ego networks overlap
• And how do we deal with the sampling issue?
44. BUT: Data Issues
• Business calls?
• Some of the (longer) calls may be to ISPs
• Filter?
• Some things don’t match up:
• 36 of the Home OnLine panel households monitored were not BT customers
(or so they said in the survey!)
• Of the 136,732 different callers, 17,947 had no recorded number
•
•
•
Of the 124,032 different callees 1,963 had no recorded number and 21,295 had a
number 3 digits long (54% of these were calls to 192, 13% to ‘123’)
There are many semi-duplicates (impossible calls) - internal BT data?
And there are call type codes no-one understands any more!
> When you didn’t design the data source…
•
•
Provenance is crucial!
Data forensics skills are critical!
45. Thank you!
• Dr Ben Anderson
• benander@essex.ac.uk
• http://cresi.essex.ac.uk/getperson?personID=1
• Dr Alexei Verntiski
• asvern@essex.ac.uk
• http://www.essex.ac.uk/maths/staff/profile.aspx?ID=1275
• BT funded feasibility project:
• http://cresi.essex.ac.uk/getproject?projectID=46
Editor's Notes
Tesco: Manchester - Sustainable Consumption Institute has daily update of Tesco’s club card data - detailed purchase logs of all club card users
Nectar: direct and indirect collection
In a way ‘Sociology’ gets cut out of the loop
Industrial not as in the study of industry but as an industry in itself. Knowing Capitalism
Software & social stratification
- will the ability to be much smarter about neighborhood data lead to an increase in local homogeneity as similar people cluster?
Industrial not as in the study of industry but as an industry in itself. Knowing Capitalism
Software & social stratification
- will the ability to be much smarter about neighborhood data lead to an increase in local homogeneity as similar people cluster?
To introduce the data…
To introduce the data…
To introduce the data…
To introduce the data…
To introduce the data…
To introduce the data…
Inspiration… the idea is that diversity/heterogeneity in communication networks should enable greater economic opportunity through ‘weak ties’ (Granovetter/Burt etc).
Over-localised networks seen as limited
Eagle et al used a complex measure of entropy and structural holes -> high diversity has negative correlation with deprivation
To introduce the data…
Same story
dominance of local calls
% local calls increases as deprivation increases
Instead of aggregating to LSOA level, this is household level
Same story
dominance of local calls
% local calls increases as deprivation increases
Same story
dominance of local calls
% local calls increases as deprivation increases
Same story
dominance of local calls
% local calls increases as deprivation increases
% local calls increases as deprivation increases
Same story
dominance of local calls
% local calls increases as deprivation increases