Data on Polarization, Peace, and PropagandaIngmar Weber
Slide deck used during a presentation at STI Forum side event on "Innovating for Peace", hosted by the UN missions of Turkey and Qatar, as well as UN DPPA. More about the STI Forum at https://www.un.org/ecosoc/en/events/2021/multi-stakeholder-forum-science-technology-and-innovation-sustainable-development-goals. The presentation features work by QCRI scientists, including Muhammad Imran and Preslav Nakov, and many others. See the last slide for references.
Hate Speech, Polarization and Online DataIngmar Weber
Slides for keynote talk at workshop on hate speech detection and genocide/politicide prediction organized by Ben Goldsmith (https://researchers.anu.edu.au/researchers/goldsmith-b) and Marian-Andrei Rizoiu (https://cecs.anu.edu.au/people/marian-andrei-rizoiu) at the Australian National University (ANU) on November 26/27, 2018.
Monitoring migration using social media data an introductionIngmar Weber
Webinar hosted by Georgetown Global Human Development Program. Recording of the session is available at https://georgetown.zoom.us/rec/play/vMElIrir-mg3HYWWtwSDVP4rW461Jqis2iAZ8qUEyErnBiQGNFqlb7tEM-ofDv5GgHLYljjfYoBR0852?continueMode=true&_x_zm_rtaid=8UtYz48MSICS36P_gsSTDg.1588250263912.a1e6098b19d99104d94a3a1063c22f70&_x_zm_rhtaid=503
Original eventbrite at https://www.eventbrite.com/e/monitoring-migration-using-social-media-data-an-introduction-tickets-102687830064
Tapping into advertising platforms to monitor ict usage and moreIngmar Weber
Examples for how to use publicly available, aggregate and anonymous data from online advertising platforms to monitor (i) digital gender gaps, (ii) international migration, and (iii) income inequalities. Details at https://ingmarweber.de/publications/. Joint work with Ridhi Kashyap, Masoomali Fatehkia, Joao Palotti, Emilio Zagheni and others.
Using Advertising Platforms for Social GoodIngmar Weber
Presentation given as part of RMIT's Data Analytics of Social Impact event on December 1. More details about the work at https://ingmarweber.de/publications/. The work presented is led by the Qatar Computing Research Institute but involves many other collaborators and stakeholders.
Presentation given as part of a panel on "Innovative Applications of AI & Data Science" on Tuesday, May 28, 2019, as part of the ITU AI for Good Global Summit. See https://ingmarweber.de/publications/ for additional details about the work.
Presentation given during panel on "Closing inequalities and gender divides" at UNESCO Mobile Learning Week on March 28. Program at https://en.unesco.org/sites/default/files/unesco-mlw2018-programme-en.pdf#page=11. References at https://ingmarweber.de/publications/. Coming soon: http://www.digitalgendergaps.org/. Joint work with Ridhi Kashyap and Masoomali Fatehkia.
Slides for presentation given at Immigration, Refugees and Citizenship Canada (IRCC) in Ottawa as part of the "Research Matters" series on Sep 25. Joint work with Emilio Zagheni, Kiran Garimella, Joao Palotti and others. See https://ingmarweber.de/publications/ for publications and citation information. The trip to Ottawa is supported in part by ACM's Distinguished Speakers Program (https://speakers.acm.org/speakers/weber_7123).
Data on Polarization, Peace, and PropagandaIngmar Weber
Slide deck used during a presentation at STI Forum side event on "Innovating for Peace", hosted by the UN missions of Turkey and Qatar, as well as UN DPPA. More about the STI Forum at https://www.un.org/ecosoc/en/events/2021/multi-stakeholder-forum-science-technology-and-innovation-sustainable-development-goals. The presentation features work by QCRI scientists, including Muhammad Imran and Preslav Nakov, and many others. See the last slide for references.
Hate Speech, Polarization and Online DataIngmar Weber
Slides for keynote talk at workshop on hate speech detection and genocide/politicide prediction organized by Ben Goldsmith (https://researchers.anu.edu.au/researchers/goldsmith-b) and Marian-Andrei Rizoiu (https://cecs.anu.edu.au/people/marian-andrei-rizoiu) at the Australian National University (ANU) on November 26/27, 2018.
Monitoring migration using social media data an introductionIngmar Weber
Webinar hosted by Georgetown Global Human Development Program. Recording of the session is available at https://georgetown.zoom.us/rec/play/vMElIrir-mg3HYWWtwSDVP4rW461Jqis2iAZ8qUEyErnBiQGNFqlb7tEM-ofDv5GgHLYljjfYoBR0852?continueMode=true&_x_zm_rtaid=8UtYz48MSICS36P_gsSTDg.1588250263912.a1e6098b19d99104d94a3a1063c22f70&_x_zm_rhtaid=503
Original eventbrite at https://www.eventbrite.com/e/monitoring-migration-using-social-media-data-an-introduction-tickets-102687830064
Tapping into advertising platforms to monitor ict usage and moreIngmar Weber
Examples for how to use publicly available, aggregate and anonymous data from online advertising platforms to monitor (i) digital gender gaps, (ii) international migration, and (iii) income inequalities. Details at https://ingmarweber.de/publications/. Joint work with Ridhi Kashyap, Masoomali Fatehkia, Joao Palotti, Emilio Zagheni and others.
Using Advertising Platforms for Social GoodIngmar Weber
Presentation given as part of RMIT's Data Analytics of Social Impact event on December 1. More details about the work at https://ingmarweber.de/publications/. The work presented is led by the Qatar Computing Research Institute but involves many other collaborators and stakeholders.
Presentation given as part of a panel on "Innovative Applications of AI & Data Science" on Tuesday, May 28, 2019, as part of the ITU AI for Good Global Summit. See https://ingmarweber.de/publications/ for additional details about the work.
Presentation given during panel on "Closing inequalities and gender divides" at UNESCO Mobile Learning Week on March 28. Program at https://en.unesco.org/sites/default/files/unesco-mlw2018-programme-en.pdf#page=11. References at https://ingmarweber.de/publications/. Coming soon: http://www.digitalgendergaps.org/. Joint work with Ridhi Kashyap and Masoomali Fatehkia.
Slides for presentation given at Immigration, Refugees and Citizenship Canada (IRCC) in Ottawa as part of the "Research Matters" series on Sep 25. Joint work with Emilio Zagheni, Kiran Garimella, Joao Palotti and others. See https://ingmarweber.de/publications/ for publications and citation information. The trip to Ottawa is supported in part by ACM's Distinguished Speakers Program (https://speakers.acm.org/speakers/weber_7123).
Digital Gender Gaps Seen Through Social MediaIngmar Weber
Keynote given on September 25 at the 5th Annual International Conference on Data Science and Business Analytics (ICDSBA2021, http://www.icdsbaconference.com/2021/menu/keynotes). Based on joint work with Ridhi Kashyap (University of Oxford), Masoomali Fatehkia (Qatar Computing Research Institute), and others. Includes preliminary observations from changes in Afghanistan since the Taliban take-over. Details about published work at https://ingmarweber.de/publications/.
Biases in Social Media Research (NoBias EU project)Miriam Fernandez
Biases that emerge in Social Media Research. Talk presented at the NoBias EU project. Inspired by Olteanou et al. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries (2019)
This is an invited talk I presented at the University of Zurich, speakers' series 2.10.2017. The presentation is based on the following paper: Brandtzaeg, P. B., & Følstad, A. (2017). Trust and distrust in online fact-checking services. Communications of the ACM. 60(9): 65-71
Presentation at NJ GMIS Conference, April 11, 2013. (Template is copyrighted. If you want a PDF copy, contact us via MorrisCountyNJ.gov feedback form.)
Research seminar Queen Mary University of London (CogSci)Miriam Fernandez
Research Seminar at Queen Mary University of London (CogSci) 2nd December 2020. In this talk, we present and discuss various research and development projects focused on addressing some of the societal challenges of today’s world (misinformation spreading, extremism, child grooming) by means of social data science. These problems are complex, dynamic and heterogeneous, and cannot be looked at from a single lens. We will discuss how these problems are addressed from a multidisciplinary angle, combining theories, models and methods from social science, computer science, or psychology; bringing a deeper understanding of the problems, and their relations to users and their behaviours, to the proposed solutions.
How does fakenews spread understanding pathways of disinformation spread thro...Araz Taeihagh
What are the pathways for spreading disinformation on social media platforms? This article addresses this question by collecting, categorising, and situating an extensive body of research on how application programming interfaces (APIs) provided by social media platforms facilitate the spread of disinformation. We first examine the landscape of official social media APIs, then perform quantitative research on the open-source code repositories GitHub and GitLab to understand the usage patterns of these APIs. By inspecting the code repositories, we classify developers' usage of the APIs as official and unofficial, and further develop a four-stage framework characterising pathways for spreading disinformation on social media platforms. We further highlight how the stages in the framework were activated during the 2016 US Presidential Elections, before providing policy recommendations for issues relating to access to APIs, algorithmic content, advertisements, and suggest rapid response to coordinate campaigns, development of collaborative, and participatory approaches as well as government stewardship in the regulation of social media platforms.
Chung-Jui LAI - Polarization of Political Opinion by News MediaREVULN
In 2016 US election, social media played a vital role in shaping public opinions as expressed by the news media that have created the phenomenon of polarization in the United States. Because social media gave people the ability to follow, share, post, comment below everything, the phenomenon of political opinions being spread easily and quickly on social media by the news agencies is bringing out a significantly polarized populace.
Consequently, it’s very important to understand the language differences on Twitter and figure out how propaganda spread by different political parties that influence or perhaps mislead public opinion. This talk will introduce the relationship among the social media, public opinion, and news media, then suggests the method to collect the tweets from Twitter and conduct sentimental and logistic regression analysis on them. Furthermore, this talk points out the special aspect on the relationship between the polarization and the topic of this conference (fake news, disinformation and propaganda).
Main points:
- situation in Taiwan
- research on fake news
- methods for fighting fake news
Social Media and You (for tweeners/teens)Anne Arendt
Made for Girls Retreat on Wisdom Conference (http://www.uvu.edu/wsc/grow/) held June 14, 2012. Target audience is tweeners and early teens. The idea is not to discourage social media but to explain the role it plays and encourage wise decisions.
Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...Lewis Shepherd
Discussion of current Microsoft Research projects and prospects which help drive open innovation and agile experimentation via cloud-based services; and projects which aim at advancing the state-of-the-art in knowledge representation and reasoning under uncertainty at web scale. I also begin by discussing potential malign implications of mass automated implementations of linked-data systems, as functions of what governments (and users of public data) can/should/shouldn’t do in promoting mass activity.
Technology Use of College Students: An Exploratory StudyJoseph Stabb, ABD
3/2007
“Technology Use of College Students: An Exploratory Study”
Published by the Department of Communication, Rochester Institute of Technology
Stabb, J. (2007). Technology use of college students: an
exploratory study. Converent for Undergraduate Research in Communication, 177-185.
GitHub as Transparency Device in Data Journalism, Open Data and Data ActivismLiliana Bounegru
Slides from presentation of research agenda around uses of GitHub in journalism at the Digital Methods Summer School 2015. More details here: http://lilianabounegru.org/2015/07/08/github-as-transparency-device-in-data-journalism-open-data-and-data-activism/
Online Misinformation: Challenges and Future DirectionsMiriam Fernandez
Paper: http://oro.open.ac.uk/53734/
Misinformation has become a common part of our digital media
environments and it is compromising the ability of our societies to
form informed opinions. It generates misperceptions, which have
affected the decision making processes in many domains, including economy, health, environment, and elections, among others.
Misinformation and its generation, propagation, impact, and management is being studied through a variety of lenses (computer science, social science, journalism, psychology, etc.) since it widely affects multiple aspects of society. In this paper we analyse the phenomenon of misinformation from a technological point of view. We study the current socio-technical advancements towards addressing the problem, identify some of the key limitations of current technologies, and propose some ideas to target such limitations.
The goal of this position paper is to reflect on the current state
of the art and to stimulate discussions on the future design and
development of algorithms, methodologies, and applications
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsLiliana Bounegru
Talk given at the Digital Methods Winter School 2017 at the University of Amsterdam. It presents a selection of projects developed at the 2016 Digital Methods Winter and Summer Schools (www.digitalmethods.net).
Digital Trace Data for Demographic ResearchIngmar Weber
Lecture given as part of the BIGSSS 2019 summer school on migration (https://bigsss-css.jacobs-university.de/migration2019/migration/). See https://ingmarweber.de/publications/ for related publications. Mostly joint work with Emilio Zagheni.
Digital Gender Gaps Seen Through Social MediaIngmar Weber
Keynote given on September 25 at the 5th Annual International Conference on Data Science and Business Analytics (ICDSBA2021, http://www.icdsbaconference.com/2021/menu/keynotes). Based on joint work with Ridhi Kashyap (University of Oxford), Masoomali Fatehkia (Qatar Computing Research Institute), and others. Includes preliminary observations from changes in Afghanistan since the Taliban take-over. Details about published work at https://ingmarweber.de/publications/.
Biases in Social Media Research (NoBias EU project)Miriam Fernandez
Biases that emerge in Social Media Research. Talk presented at the NoBias EU project. Inspired by Olteanou et al. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries (2019)
This is an invited talk I presented at the University of Zurich, speakers' series 2.10.2017. The presentation is based on the following paper: Brandtzaeg, P. B., & Følstad, A. (2017). Trust and distrust in online fact-checking services. Communications of the ACM. 60(9): 65-71
Presentation at NJ GMIS Conference, April 11, 2013. (Template is copyrighted. If you want a PDF copy, contact us via MorrisCountyNJ.gov feedback form.)
Research seminar Queen Mary University of London (CogSci)Miriam Fernandez
Research Seminar at Queen Mary University of London (CogSci) 2nd December 2020. In this talk, we present and discuss various research and development projects focused on addressing some of the societal challenges of today’s world (misinformation spreading, extremism, child grooming) by means of social data science. These problems are complex, dynamic and heterogeneous, and cannot be looked at from a single lens. We will discuss how these problems are addressed from a multidisciplinary angle, combining theories, models and methods from social science, computer science, or psychology; bringing a deeper understanding of the problems, and their relations to users and their behaviours, to the proposed solutions.
How does fakenews spread understanding pathways of disinformation spread thro...Araz Taeihagh
What are the pathways for spreading disinformation on social media platforms? This article addresses this question by collecting, categorising, and situating an extensive body of research on how application programming interfaces (APIs) provided by social media platforms facilitate the spread of disinformation. We first examine the landscape of official social media APIs, then perform quantitative research on the open-source code repositories GitHub and GitLab to understand the usage patterns of these APIs. By inspecting the code repositories, we classify developers' usage of the APIs as official and unofficial, and further develop a four-stage framework characterising pathways for spreading disinformation on social media platforms. We further highlight how the stages in the framework were activated during the 2016 US Presidential Elections, before providing policy recommendations for issues relating to access to APIs, algorithmic content, advertisements, and suggest rapid response to coordinate campaigns, development of collaborative, and participatory approaches as well as government stewardship in the regulation of social media platforms.
Chung-Jui LAI - Polarization of Political Opinion by News MediaREVULN
In 2016 US election, social media played a vital role in shaping public opinions as expressed by the news media that have created the phenomenon of polarization in the United States. Because social media gave people the ability to follow, share, post, comment below everything, the phenomenon of political opinions being spread easily and quickly on social media by the news agencies is bringing out a significantly polarized populace.
Consequently, it’s very important to understand the language differences on Twitter and figure out how propaganda spread by different political parties that influence or perhaps mislead public opinion. This talk will introduce the relationship among the social media, public opinion, and news media, then suggests the method to collect the tweets from Twitter and conduct sentimental and logistic regression analysis on them. Furthermore, this talk points out the special aspect on the relationship between the polarization and the topic of this conference (fake news, disinformation and propaganda).
Main points:
- situation in Taiwan
- research on fake news
- methods for fighting fake news
Social Media and You (for tweeners/teens)Anne Arendt
Made for Girls Retreat on Wisdom Conference (http://www.uvu.edu/wsc/grow/) held June 14, 2012. Target audience is tweeners and early teens. The idea is not to discourage social media but to explain the role it plays and encourage wise decisions.
Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...Lewis Shepherd
Discussion of current Microsoft Research projects and prospects which help drive open innovation and agile experimentation via cloud-based services; and projects which aim at advancing the state-of-the-art in knowledge representation and reasoning under uncertainty at web scale. I also begin by discussing potential malign implications of mass automated implementations of linked-data systems, as functions of what governments (and users of public data) can/should/shouldn’t do in promoting mass activity.
Technology Use of College Students: An Exploratory StudyJoseph Stabb, ABD
3/2007
“Technology Use of College Students: An Exploratory Study”
Published by the Department of Communication, Rochester Institute of Technology
Stabb, J. (2007). Technology use of college students: an
exploratory study. Converent for Undergraduate Research in Communication, 177-185.
GitHub as Transparency Device in Data Journalism, Open Data and Data ActivismLiliana Bounegru
Slides from presentation of research agenda around uses of GitHub in journalism at the Digital Methods Summer School 2015. More details here: http://lilianabounegru.org/2015/07/08/github-as-transparency-device-in-data-journalism-open-data-and-data-activism/
Online Misinformation: Challenges and Future DirectionsMiriam Fernandez
Paper: http://oro.open.ac.uk/53734/
Misinformation has become a common part of our digital media
environments and it is compromising the ability of our societies to
form informed opinions. It generates misperceptions, which have
affected the decision making processes in many domains, including economy, health, environment, and elections, among others.
Misinformation and its generation, propagation, impact, and management is being studied through a variety of lenses (computer science, social science, journalism, psychology, etc.) since it widely affects multiple aspects of society. In this paper we analyse the phenomenon of misinformation from a technological point of view. We study the current socio-technical advancements towards addressing the problem, identify some of the key limitations of current technologies, and propose some ideas to target such limitations.
The goal of this position paper is to reflect on the current state
of the art and to stimulate discussions on the future design and
development of algorithms, methodologies, and applications
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsLiliana Bounegru
Talk given at the Digital Methods Winter School 2017 at the University of Amsterdam. It presents a selection of projects developed at the 2016 Digital Methods Winter and Summer Schools (www.digitalmethods.net).
Digital Trace Data for Demographic ResearchIngmar Weber
Lecture given as part of the BIGSSS 2019 summer school on migration (https://bigsss-css.jacobs-university.de/migration2019/migration/). See https://ingmarweber.de/publications/ for related publications. Mostly joint work with Emilio Zagheni.
Engaging with Patients Online: The do’s and don’t’s, and what’s to gainKatja Reuter, PhD
These slides were presented at the the Annual Meeting of the American College of Rheumatology (ACR) and Association of Rheumatology Health Professionals (ARHP) on Nov 15, 2016 in Washington DC. The presentation highlights ways in which physician-scientists may reach and engage patients online for different purposes such as health promotion, study recruitment, attracting patients, and reputation building. The presentation also touches upon tracking online activities for performance reviews and responding to negative reviews.
This webinar will demonstrate how to use HealthyCity.org to enhance your grant proposals and reports with visually impactful and relevant data and maps. Learn how to access data highlighting the needs and opportunities within your communities and how to make the case that your program will make a difference.
Digital & Social Media: 3-step action plan for clinical trials volunteer recr...Aimee Edgeworth
This presentation provides guidance on using social media for volunteer recruitment to research studies, including a case example from ResearchMatch. Presented in the Clinical Research Center of Vanderbilt University Medical Center for continued education credit and shared on the CTSA's Trial Innovation Network Recruitment Toolkit.
During the COVID-19 Global Pandemic, there were multiple lessons provided to the world. In this talk, I set the stage for the discussion, highlight the issues we faced (and still face), I speak to an effort that contributed to help address one of those issues, then speak to future challenges and our responsibilities going forward.
Healthy City works with community-based organizations to apply Community Based Participatory Action Research (CBPAR) in their mapping and community-engagement work. CBPAR starts with issues and strategies to produce analysis, uses mapping technology as one tool for community engagement and focuses on communities within a geographic location, such as a neighborhood. Using CPBAR in mapping facilitates engagement, education, strategizing, and dialogue among community members--including youth--and decision-makers.
Including young people in map making allows them to contribute their unique knowledge and lived experiences as community residents. Youth can provide invaluable insight and can act as change agents advocating on behalf of their communities. Whether you are a Youth Organizer, Community Liaison or Direct Service Provider, there are a number of ways you can incorporate and share youth data and stories using a variety of free resources and tools available on HealthyCity.org to build community power.
In this webinar you will learn how to:
1) Research and map youth population data to enhance program focus and planning on healthycity.org
2) Upload your own data onto a map
3) Use Wikimaps to better plan, collaborate and share youth outreach strategies and stories
Social media analysis for better policy makingIngmar Weber
Slides for my presentation during the "Social Media and the Law" Symposium on Wednesday, October 18 (http://www.hbku.edu.qa/en/academic-events/colloquium-social-media-and-law). See https://ingmarweber.de/publications/ for a full list of my publications in computational social science.
Similar to Digital Demography - Keynote at SocInfo'18 (20)
Different Hashtags, Different Opinions - Twitter Polarization in EgyptIngmar Weber
Slides for a flash talk given on July 12 at the online workshop on "Modeling and Measuring Social Cohesion using Methods of Computational Social Science" organized by the Bremen International Graduate School of Social Sciences (BIGSSS). Joint work with Kiran Garimella, Alaa Bateyneh, and others. Details at https://ingmarweber.de/publications/.
Not so-obvious social media analysis to study current affairsIngmar Weber
This instalment of the (un)data Seminar Series on Outrageous Questions will discuss topics related to studying political extremism and monitoring global development through social media analysis. Dr. Weber will explain the methods and analysis behind three case studies of political analysis and global development: first, how one can use Twitter to understand the antecedents of ISIS support by building a classifier that, in retrospect, predicts if a Twitter user will oppose or support ISIS. The features that are predictive (or not) of ISIS support help to understand potential motivations. Second, a methodology for monitoring political polarization and show how increases in this polarization measure tended to precede outbreaks in Egypt. Third, how publicly accessible advertising data from Facebook can de repurposed to monitor migration and track internet access gender gaps around the globe. The overarching goal is to illustrate how despite challenges around lack of representativeness, social media can provide useful signals to study current affairs.
Digital advertising data for migration researchIngmar Weber
Presentation given at the International Metropolis Conference on June 25 in Ottawa. The conference trip was financially supported by the European Commission's Joint Research Center as well as by ACM's Distinguished Speakers Program.
Using advertising data to model migration, poverty and digital gender gapsIngmar Weber
Talk given at the Machine Learning and Data Analytics Symposium (MLDAS 2019). https://qcai.qcri.org/index.php/events/mldas-2019/.
Contact me if you're interested in the topic of poverty mapping or data for development in general.
Correlated Impulses: Using Facebook Interests to Improve Predictions of Crime...Ingmar Weber
Presentation at the Annual Meeting of the Population Association of America (PAA 2019) in the Session Using Social Media in Population Research (http://paa2019.populationassociation.org/sessions/128). See https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0211350 for the full paper.
Estimating Migration and Quantifying Migrant Assimilation Using Internet Adve...Ingmar Weber
Presentation given at HBKU Colloquium on Big Data and the Law (https://hbku.edu.qa/en/academic-events/digital-humanities-big-data-law). Joint research with Emilio Zagheni and others on monitoring international migration using Facebook advertising data.
Using internet advertising data for studying international migrationIngmar Weber
Slides used at #BigData4Migration workshop. https://ec.europa.eu/jrc/en/event/workshop/big-data-and-alternative-data-sources-migration-case-studies-policy-support
See https://ingmarweber.de/publications/ for some related publications.
Matching Methods and Natural Experiments - Examples of Causal Inference from ...Ingmar Weber
Invited talk given at Observational Studies Through Social Media workshop (OSSM, https://www.microsoft.com/en-us/research/event/ossm17/) at ICWSM'17. Includes both my own but, mostly, other people's work.
Studies covered:
"Detecting Emotional Contagion in Massive Social Networks", http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0090315
"Exercise contagion in a global social network", https://www.nature.com/articles/ncomms14753
"How Community Feedback Shapes User Behavior", https://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8066
"A Warm Welcome Matters!: The Link Between Social Feedback and Weight Loss in /r/loseit", http://dl.acm.org/citation.cfm?id=3055131
See https://ingmarweber.de/publications/ for my own work.
Not-so-obvious Online Data Sources for Demographic ResearchIngmar Weber
Slides from ICWSM'17 workshop on Social Media for Demographic Research (https://sites.google.com/site/smdrworkshop/program). Data sets include Facebook's ad audience estimates, Google Correlate, online genealogy and much more. Contact Ingmar directly to learn more.
A Warm Welcome Matters! The Link Between Social Feedback and Weight Loss in /...Ingmar Weber
Presentation in the Web Science track at WWW'17. Full paper https://ingmarweber.de/wp-content/uploads/2017/04/A-Warm-Welcome-Matters-The-Link-Between-Social-Feedback-and-Weight-Loss-in-r-loseit.pdf. Work led by Tiago Cunha (https://twitter.com/tocunha).
Abstract of paper:
Social feedback has long been recognized as an important
element of successful health-related behavior change. However, most of the existing studies look at the effect that online social feedback has. This paper fills gaps in the literature by proposing a framework to study the causal effect
that receiving social support in the form of comments in an
online weight loss community has on (i) the probability of
the user to return to the forum, and, more importantly, on
(ii) the weight loss reported by the user. Using a matching
approach for causal inference we observe a difference of 9
lbs lost between users who do or do not receive comments.
Surprisingly, this effect is mediated by neither an increase in
lifetime in the community nor by an increased activity level
of the user. Our results show the importance that a "warm
welcome" has when using online support forums to achieve
health outcomes.
Social Media Research and Practice in the Health Domain - Tutorial, Part IIIngmar Weber
Second part of tutorial given at Weill Cornell Medicine Qatar on February 18, 2017 (https://qatar-weill.cornell.edu/bchp/socialMediaResearchPracticeHealthDomain.html). First part given by Luis Luque (see https://www.slideshare.net/luis.luque/social-media-research-in-the-health-domain-tutorial).
Digital Demography - WWW'17 Tutorial - Part IIIngmar Weber
Second part of a tutorial given at WWW'17 (http://www2017.com.au/) on Digital Demography. More information about the tutorial at https://sites.google.com/site/digitaldemography/. Please reference the archival tutorial description (at http://papers.www2017.com.au.s3-website-ap-southeast-2.amazonaws.com/companion/p935.pdf) when using the material.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
2. About ACM
ACM, the Association for Computing Machinery (www.acm.org), is the
premier global community of computing professionals and students
with nearly 100,000 members in more than 170 countries interacting
with more than 2 million computing professionals worldwide.
OUR MISSION: We help computing professionals to be their best and
most creative. We connect them to their peers, to what the latest
developments, and inspire them to advance the profession and make a
positive impact on society.
OUR VISION: We see a world where computing helps solve tomorrow’s
problems – where we use our knowledge and skills to advance the
computing profession and make a positive social impact throughout the
world.
3. The Distinguished Speakers Program
is made possible by
For additional information, please visit http://dsp.acm.org/
5. What is Demography?
Demography is the statistical study of populations.
According to IP address 70.67.193.176, user Pbsouthwood and other
contributors to https://en.wikipedia.org/wiki/Demography
6. The Population Equation
Change in population = Inputs – Outputs
Inputs = Births + In-migration
Outputs = Deaths + Out-Migration
• ∆P = (B + I) − (D + O)
Fertility, Mortality and Migration
7. Quant: How much? Where? When?
• Births
- Birth registry: India: ~75%, Kenya: ~65%, Liberia: ~25% (2017)
• Deaths
- “Global Burden of Disease” (Murray and Lopez, 1997):
“Medically certified information is available for less than 30% of
the estimated 50.5 million deaths that occur each year
worldwide.”
• Migration
- “The size of the irregular migrant stock of the EU-27 in 2008
was measured to be between 1.9 and 3.8 million, a decline from
between 2.4 and 5.4 million in the EU-25 in 2005” (Kovacheva
and Vogel, 2009).
8. Qual: Why? How?
• Births
- Effect of religiosity, available childcare, …
• Deaths
- Ikigai: “reason to get up in the morning”
• Migration
- Push/pull factors, assimilation, …
9. Opportunities for New methods
• Filling data gaps
– New data on migration, fertility, employment, …
• Explaining behavior
– Richer data, including networks and long-term history
• Predictive modeling
– Multi-modal forecasting
• Take a global perspective on things
– Facebook, Google, satellites know (almost) no borders
Goal is to augment, not replace, traditional approaches
Big Data is not a cure-all panacea
10. Rest of the Talk: Data-Centric
• Online advertising audience estimates
- Migration stocks, migrant assimilation
- Male mean-age-at-childbirth
- Ethics, limitations and challenges
• More non-obvious data sources
- Google Correlate, Followerwonk
- Even more non-obvious data sources
• Thoughts on interdisciplinary work
21. Bias Reduction via Model-Fitting
Mean out-of-sample absolute percentage error 37%,
down from 56% without origin-age bias correction
Adjusted R^2 = .70
Does not use GDP, language, internet penetration, …
z = age-gender group
i = country of birth
j = US state of residence
23. Do Refugees Share German Interests?
What interests to consider? Everybody likes “Music” and “Technology”.
How to interpret the score? High/low compared to European migrants?
Germans in DEU
FB Interests:
Football (90%)
Max Planck (70%)
Sauerkraut (40%)
…
Arabs in MENA
FB Interests:
Quran (80%)
Ibn Al-Haytham (60%)
Falafel (60%)
…
Arabs in DEU
FB Interests:
?
24. Obtaining an Assimilation Score
Migrant Group Assim. Score
Austrian migrants .900
Spanish migrants .864
French migrants .803
Turkish-speaking migrants .746
Arabic-speaking migrants .643
A: Women, non-uni, 45-64 .461
A: Men, uni, 18-24 .677
• Experimental methodology: take with a ton, not just a grain of salt
• Needs to be validated externally
• Goals include finding “bridging” interests/patterns
32. Ethical Challenges
• Privacy
– Was possible to obtain PII until early 2018 [Venkatadri
et al., 2018]
– Audience estimates for “custom audiences” no longer
supported
– The k in k-anonymity has been increased
• Vulnerable populations
– Was possible to exclude minorities from ads
– Was possible to target based on likely diseases
– Still targetable through proxy interests
We only use aggregate, anonymous data without
interacting with any user
33. Limitations: Selection Bias
Aren’t you just studying FB/LI/… vs. the “real
world”?
• If we understand the selection bias, we can
model it and de-bias the estimates
– Non-response biases in surveys
– Usual signal in a prediction model
– Non-random fake/duplicate accounts could
become problematic depending on domain
• Even if “only” LI, still real world implications
– LI used for hiring and to find keynote speakers
34. Limitations: Black Box
Who knows how FB’s classifier labels “expats” or
SC’s classifier labels “math enthusiasts”?
• Use as signal, not as ground truth
– Empirically, highly predictive of “proper” definition
– Unified definition can be a plus
• Incentives are in the right place
– Companies try to provide values to advertisers and,
hence, are incentivized to have correct labels
• Inconsistencies over time problematic
– Especially for “interests” large temporal fluctuations
35. Limitations: No Longitudinal Data
None of the services provide information on
running a hypothetical ad campaign in the past
• No historical data sets of audience estimates
exist
– Hard to do causal inference (natural experiments)
• Similar to Twitter streaming API
– The best time to start collecting data is 20 years
ago. The second best time is today.
36. Limitations: What about Myspace?
Services come and go and FB et al. might
become obsolete
• Only useful for understanding and modeling
processes with current relevance
Usage patterns change over time
• FB of 2008 unlike FB of 2018.
• Users might become more privacy concerned.
• Re-validate and re-train your model over time.
38. Google Correlate and Fertility
Discover search terms correlated with different fertility rates across US
states
https://www.google.com/trends/correlate/search?e=id:f7PU4mFDWV-
&t=all
Remove terms with no conceivable link to sex, pregnancy or maternity
39. Predicting Spatial Variability
• Performance of the regression models using
leave-one-out cross-validation. SMAPE is in [%], RMSE
values are multiplied by 1,000.
Use the previous terms to build
models predicting state-level fertility
rates
All these models make predictions
based on linear combinations of
search intensity
Goal: apply these spatial models
across time
40. Learning Across Space, Predicting Across
Time
• Temporal trend when applying the “teen” model
across time. Values are rescaled to a maximum of 1.0.
Pearson r correlation across 2010-2015 when using
the spatial model to predict trends across time.
41. Followerwonk and Gender Roles
(mother|mom) of … (father|dad) of …
… (girls|daughters) 1,257 303 1,560
… (sons|boys) 941 545 1,486
2,198 848
Location: (us|usa|united states)
https://followerwonk.com/bio/?q=(father|dad)%20of%20(sons|boys)&l=(us|usa|united%20states)
42. More Creative Data Sources
Online genealogy
- see how marriage mobility has changed
Online obituaries
- monitor patients discharged from hospital
Google Street View
- parked cars tell income and political orientation
https://sites.google.com/site/digitaldemography/
47. Thanks!
Interested in a collaboration? Get in touch:
iweber@hbku.edu.qa
References and full texts: https://ingmarweber.de/publications/
48. About ACM
ACM, the Association for Computing Machinery (www.acm.org), is the
premier global community of computing professionals and students
with nearly 100,000 members in more than 170 countries interacting
with more than 2 million computing professionals worldwide.
OUR MISSION: We help computing professionals to be their best and
most creative. We connect them to their peers, to what the latest
developments, and inspire them to advance the profession and make a
positive impact on society.
OUR VISION: We see a world where computing helps solve tomorrow’s
problems – where we use our knowledge and skills to advance the
computing profession and make a positive social impact throughout the
world.
49. The Distinguished Speakers Program
is made possible by
For additional information, please visit http://dsp.acm.org/