SlideShare a Scribd company logo
1 of 25
OFFICIALASONAM 7-10 Dec 2020
Derek Weber1,2 & Frank Neumann1
Contact: derek.weber@adelaide.edu.au
1 School of Computer Science, University of Adelaide, Australia.
2 Defence Science and Technology Group, Department of Defence, Australia.
WHO’S IN THE GANG?
REVEALING COORDINATING COMMUNITIES IN SOCIALMEDIA
OFFICIALASONAM 7-10 Dec 2020 2
https://twitter.com/conspirator0/status/1328479128908132358 17 Nov 2020
OFFICIALASONAM 7-10 Dec 2020
Context
• Social media for political communication
• Targeted marketing → (Political) Spam & recruitment
• Anonymity → Trolls
• Automation → Bots, social bots & political bots
3
Targeted marketing + Anonymity + Automation = Interference
OFFICIALASONAM 7-10 Dec 2020
Coordination
4
Malone & Crowston (1994)
T1 T2
T1
T2
R1
T1
T2
Starbird et al. (2019)
1 https://www.nbcnews.com/politics/national-security/russian-trolls-who-interfered-2016-u-s-election-also-made-n1013811
2 Bianoconi (2015) DOI: 10.1209/0295-5075/111/56001
3 https://commons.wikimedia.org/wiki/File:Social_media_icon.png
3
CultivatedOrchestrated
1
Emergent
2
OFFICIALASONAM 7-10 Dec 2020
Information Campaigns & Coordination Strategies
5
Intent Strategy Planning Execution
Post Repost
time
time
Hostile
Friendly
Good Post Junk Post
time
Channel: e.g., #OurPartyRocks
time
t0 t1
t1 t2
t2 t3
Pollution
Woolley (2016)
Fisher (2018)
Nasim et al. (2018)
Boost
Cao et al. (2015)
Vo et al. (2017)
Graham et al. (2020)
Bully
Hine et al. (2017)
Kumar et al. (2018)
OFFICIALASONAM 7-10 Dec 2020
The Challenge
• Discovery
• RQ1 How can highly coordinating communities (HCCs) be found?
• Validation
• RQ2 How do the discovered communities differ?
• RQ3 How consistent is the HCC messaging?
• RQ4 Are the HCCs internally or externally focused?
6
To identify groups of accounts whose behaviour,
though typical in nature, is anomalous in degree.
OFFICIALASONAM 7-10 Dec 2020
Approach
7
(1) (2) (3) (4) (5)
@ #
,@,t1
,#,t2
,#,t3
,@,t4
,#,t5
Common
behaviours
Timed
Interactions
Evidence of
coordination
Highly
Coordinating
Communities
Latent
Coordination
Network
https://github.com/weberdc/find_hccs
OFFICIALASONAM 7-10 Dec 2020
Temporal Aspect
Time →
γ
8
https://github.com/weberdc/find_hccs
(1) (2) (3) (4) (5)
@ #
,@,t1
,#,t2
,#,t3
,@,t4
,#,t5
γ γ
OFFICIALASONAM 7-10 Dec 2020
Extract HCCs
9
Focal Structures Analysis1 – Variant (FSA_V)
https://github.com/weberdc/find_hccs
1 Şen et al., 2016
OFFICIALASONAM 7-10 Dec 2020
Extract HCCs
10
Focal Structures Analysis1 – Variant (FSA_V)
https://github.com/weberdc/find_hccs
1 Şen et al., 2016
OFFICIALASONAM 7-10 Dec 2020
Evaluation
• Window size, γ = {15, 60, 360, 1440} minutes
• Community extraction:
• FSA_V, θ = 0.3
• K Nearest Neighour (kNN), k = ln(|V|) (cf. Cao et al., 2015)
• Threshold
• Coordination strategy
• Boost (co-retweet)
• Pollute (co-hashtag)
• Bully (co-mention)
11
OFFICIALASONAM 7-10 Dec 2020
Data
DS1 – Australian regional election, 2018
• Including ground truth (GT, cf. Keller et al., 2017)
DS2 – Twitter’s election integrity dataset1
• Internet Research Agency, 2016 tweets
12
1 https://about.twitter.com/en us/values/elections-integrity.html
Tweets
(T)
Retweets (RT) Accts
(A)
Days T / A /
Day
RT / A /
Day
DS1 115.9k 64.2k 54.5% 20.6k 18 0.31 0.17
- GT 4.2k 2.5k 59.7% 134 18 1.74 1.04
DS2 1.57m 729.9k 56.6% 1.4k 365 3.12 1.45
Ethics
University of Adelaide
HREC H-2018-045https://github.com/weberdc/find_hccs
OFFICIALASONAM 7-10 Dec 2020
Finding HCCs
• Coordination Strategies
• HCCs found in all
• Many components (HCCs), incl. a very large one
• kNN – single HCC with internal structure
13
DS1 DS2
FSA_V kNN Threshold FSA_V kNN Threshold
GT
Networks: Gephi https://gephi.org
OFFICIALASONAM 7-10 Dec 2020
Membership
• Window size, γ
• Jaccard
• Yellow = Identical
• Blue = Disjoint
14
GT
DS1
DS2
OFFICIALASONAM 7-10 Dec 2020
Hashtags
15
Retweeting the same tweet
Retweeting the same account
GT DS1
DS2
OFFICIALASONAM 7-10 Dec 2020
Consistency
Hypothesis
• Dissemination groups should have
highly similar content
• i.e., Int. similarity ≥ Ext. similarity
Approach
• For each group:
• For each member:
• Combine member tweets into a corpus
• Compare 5-char n-grams of corpus
against all other accounts
• Plot similiarities as a matrix
cf. a heatmap
16
GT DS1
DS2 RANDOM
OFFICIALASONAM 7-10 Dec 2020
Connectivity & Focus
17
(log) (log)
Retweets Mentions
OFFICIALASONAM 7-10 Dec 2020 18
DNC
RNC
1 https://botometer.osome.iu.edu/
2 Rizoiu et al.(2018) 2
2
Botometer1
Recent Work:
2020 US Conventions
OFFICIALASONAM 7-10 Dec 2020 19
19
Diagrams: visone https://visone.info
Co-hashtag bigraphs
DNC RNC
OFFICIALASONAM 7-10 Dec 2020
Future
• Community extraction
• Stitching (cf. FSA)
• Conductance cut clustering
• Combinations
• Statistics
• Parameter choices
• Anomaly validation
• Temporal analysis
• Evolution of HCCs
• Edge semantics
20
OT1
10s
10s
A B C
w1 w2
OT1 OT2
10s 10s
A B C
w1 w2
ts(w2) >> ts(w1 )
OFFICIALASONAM 7-10 Dec 2020
Conclusion
• Targeted Marketing + Anonymity + Automation = Interference
• Focus: detecting coordinating groups
• Discovery:
• RQ1: Pipeline model + community extraction (FSA_V)
• Validation
• RQ2: Distinct content patterns
• RQ3: Internal consistency
• RQ4: External focus
• Recent & future work
• More detail: arXiv:2010.08180
• Code: https://github.com/weberdc/find_hccs
21
OFFICIALASONAM 7-10 Dec 2020 22
OFFICIALASONAM 7-10 Dec 2020
Literature
Campaign detection
• Content (Lee et al., 2013)
• URL sharing (Cao et al., 2015)
• Temporal signatures (Hine et al., 2017)
• Cross-platform linking (Starbird & Wilson, 2020)
Social bots
• Agenda-oriented automated accounts pretending to be human (Ferrara et al., 2016)
• Hard to identify (Cresci et al., 2017; Nasim et al., 2018; Grimme et al., 2018)
Coordination as “orchestrated activities”
• Focus on detecting strategies (Fisher, 2018; Grimme et al., 2018; Starbird et al., 2019;
Weber, 2019)
• Co-retweet (Weber, 2019; Graham et al., 2020)
• Co-hashtag (Woolley, 2016; Fisher, 2018)
• Co-URL (Cao et al., 2015; Giglietto et al., 2020)
23
OFFICIALASONAM 7-10 Dec 2020
• Brooking, E. T., and Singer, P. W. (2016). War Goes Viral: How social media is being weaponized across the world. The Atlantic. Retrieved
from https://www.theatlantic.com/magazine/archive/2016/11/war-goes-viral/501125/
• Cao, C., Caverlee, J., Lee, K., Ge, H. and Chung, J. 2015. Organic or Organized?: Exploring URL Sharing Behavior. CIKM’15, 513-522.
• Cresci, S., Pietro, R. D., Petrocchi, M., Spognardi, A. and Tesconi, M. 2017. The Paradigm-Shift of Social Spambots. WWW’17 (Companion
Volume), 963-972.
• Ferrara, E., Varol, O., Davis, C., Menczer, F. and Flammini, A. 2016. The rise of social bots. Communications of the ACM. 59(7) (Jun. 2016),
96–104. DOI:10.1145/2818717.
• Fisher, A. 2018. Netwar in Cyberia: Decoding the Media Mujahidin. USC Centre on Public Diplomacy, Figueroa Press.
• Giglietto, F., Righetti, N., Rossi, L. and Marino, G. 2020. Coordinated Link Sharing Behavior as a Signal to Surface Sources of Problematic
Information on Facebook. SMSociety, 85-91.
• Grimme, C., Assenmacher, D. and Adam, L. 2018. Changing Perspectives: Is It Sufficient to Detect Social Bots? HCI (13) 2018, 445–461.
• Graham, T., Bruns, A., Zhu, G., and Campbell, R. 2020. Like a virus: The coordinated spread of coronavirus disinformation. Centre for
Responsible Technology, The Australia Institute.
• Hine, G. E., Onaolapo, J., Cristofaro, E. D., Kourtellis, N., Leontiadis, I., Samaras, R., Stringhini, G. and Blackburn, J. 2017. Kek, Cucks, and
God Emperor Trump: A Measurement Study of 4chan’s Politically Incorrect Forum and Its Effects on the Web. ICWSM’17, 92–101.
• Keller, F.B., Schoch, D., Stier, S. and Yang, J.H. 2017. How to Manipulate Social Media: Analyzing Political Astroturfing Using Ground Truth
Data from South Korea. ICWSM’17, 564–567
• Kumar, S., Hamilton, W.L., Leskovec, J. and Jurafsky, D. 2018. Community Interaction and Conflict on the Web. Proceedings of the 2018
World Wide Web Conference, WWW’18, 933–943 .
24
References (1)
OFFICIALASONAM 7-10 Dec 2020
References (2)
• Lee, K., Caverlee, J., Cheng, Z. and Sui, D. Z. 2013. Campaign extraction from social media. ACM Transactions on Intelligent Systems and
Technology. 5(1), 9:1–9:28. DOI:10.1145/2542182.2542191.
• Lim, K. H., Jayasekara, S., Karunasekera, S., Harwood, A., Falzon, L., Dunn, J. and Burgess, G. 2019. RAPID: Real-time Analytics Platform
for Interactive Data Mining. KCML/PKDD (3) 2018. 649–653.
• Nasim, M., Nguyen, A., Lothian, N., Cope, R. and Mitchell, L. 2018. Real-time Detection of Content Polluters in Partially Observable Twitter
Networks. WWW’18 (Companion Volume), 1331-1339.
• Pacheco, D., Hui, P.-M., Torres-Lugo, C., Truong, B. T., Flammini, A. and Menczer, F. 2020-01-16. Uncovering Coordinated Networks on
Social Media. ICWSM’21, to appear.
• Rizoiu, M.-A., Graham, T., Zhang, R., Zhang, Y., Ackland, R. and Xie, L. 2018. #DebateNight: The Role and Influence of Socialbots on
Twitter During the 1st 2016 U.S. Presidential Debate. ICWSM’18, 300–309.
• Saulwick, A., & Trentelman, K. (2014). Towards a formal semantics of social influence. Knowledge-Based Systems, 71, 52–60.
DOI:10.1016/j.knosys.2014.06.022
• Şen, F., Wigand, R., Agarwal, N., Tokdemir, S., and Kasprzyk, R. 2016. Focal structures analysis: identifying influential sets of individuals in
a social network. Social Network Analysis and Mining, 6(1). DOI:10.1007/s13278-016-0319-z
• Starbird, K. and Wilson, T. 2020. Cross-Platform Disinformation Campaigns: Lessons Learned and Next Steps. Harvard Kennedy School
Misinformation Review. (Jan. 2020). DOI:10.37016/mr-2020-002.
• Starbird, K., Arif, A. and Wilson, T. 2019. Disinformation as Collaborative Work:: Surfacing the Participatory Nature of Strategic Information
Operations . Proc. ACM on Human-Computer Interaction. 3 (CSCW), 127:1–127:26. DOI:10.1145/3359229.
• Vo, N., Lee, K., Cao, C., Tran, T. and Choi, H. 2017. Revealing and detecting malicious retweeter groups. ASONAM’17, 363-368.
• Weber, D. 2019. On Coordinated Online Behaviour. Poster presented at ASNAC’19, Adelaide, Australia.
25

More Related Content

What's hot

Neso nuffic presentation in Seoul
Neso nuffic presentation in SeoulNeso nuffic presentation in Seoul
Neso nuffic presentation in Seoul
Maurice Vergeer
 
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job ApplicantsYou're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
Toronto Metropolitan University
 

What's hot (20)

GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in  Data Journalism, Open Data and Data ActivismGitHub as Transparency Device in  Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
 
Mapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital MethodsMapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital Methods
 
Studying Social Science Using E Tools
Studying Social Science Using E ToolsStudying Social Science Using E Tools
Studying Social Science Using E Tools
 
Neso nuffic presentation in Seoul
Neso nuffic presentation in SeoulNeso nuffic presentation in Seoul
Neso nuffic presentation in Seoul
 
How to get started with Data Journalism
How to get started with Data JournalismHow to get started with Data Journalism
How to get started with Data Journalism
 
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsDoing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
 
Individual project 2.20
Individual project 2.20Individual project 2.20
Individual project 2.20
 
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on TwitterSocial Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
 
Designing Cybersecurity Policies with Field Experiments
Designing Cybersecurity Policies with Field ExperimentsDesigning Cybersecurity Policies with Field Experiments
Designing Cybersecurity Policies with Field Experiments
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
Predicting Elections with Twitter
Predicting Elections with TwitterPredicting Elections with Twitter
Predicting Elections with Twitter
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
 
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
Dynamics of a Scandal: The Centrelink Robodebt Affair on TwitterDynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
 
Echo Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the EvidenceEcho Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the Evidence
 
Data augmented ethnography: 
using big data and ethnography to explore candi...
Data augmented ethnography: 
using big data and ethnography  to explore candi...Data augmented ethnography: 
using big data and ethnography  to explore candi...
Data augmented ethnography: 
using big data and ethnography to explore candi...
 
Matching Mobile Applications for Cross Promotion
Matching Mobile Applications for Cross PromotionMatching Mobile Applications for Cross Promotion
Matching Mobile Applications for Cross Promotion
 
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
 
The data we want
The data we wantThe data we want
The data we want
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
 
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job ApplicantsYou're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
 

Similar to Who’s in the Gang? Revealing Coordinating Communities in Social Media

Revealing Social Bots with Coordinated Networks during US Political Conventions
Revealing Social Bots with Coordinated Networks during US Political ConventionsRevealing Social Bots with Coordinated Networks during US Political Conventions
Revealing Social Bots with Coordinated Networks during US Political Conventions
Derek Weber
 
Classifying Twitter Content
Classifying Twitter ContentClassifying Twitter Content
Classifying Twitter Content
Stephen Dann
 
Critically Assembling Data, Processes & Things: Toward and Open Smart City
Critically Assembling Data, Processes & Things: Toward and Open Smart CityCritically Assembling Data, Processes & Things: Toward and Open Smart City
Critically Assembling Data, Processes & Things: Toward and Open Smart City
Communication and Media Studies, Carleton University
 
GIS-Pro September 2010 findings
GIS-Pro September  2010 findingsGIS-Pro September  2010 findings
GIS-Pro September 2010 findings
KSI Koniag
 
Gis pro sept 2010 findings
Gis pro sept  2010 findingsGis pro sept  2010 findings
Gis pro sept 2010 findings
KSI Koniag
 

Similar to Who’s in the Gang? Revealing Coordinating Communities in Social Media (20)

Revealing Social Bots with Coordinated Networks during US Political Conventions
Revealing Social Bots with Coordinated Networks during US Political ConventionsRevealing Social Bots with Coordinated Networks during US Political Conventions
Revealing Social Bots with Coordinated Networks during US Political Conventions
 
Classifying Twitter Content
Classifying Twitter ContentClassifying Twitter Content
Classifying Twitter Content
 
Critically Assembling Data, Processes & Things: Toward and Open Smart City
Critically Assembling Data, Processes & Things: Toward and Open Smart CityCritically Assembling Data, Processes & Things: Toward and Open Smart City
Critically Assembling Data, Processes & Things: Toward and Open Smart City
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
Automating Homelessness
Automating HomelessnessAutomating Homelessness
Automating Homelessness
 
Who are We Studying: Humans or Bots?
Who are We Studying: Humans or Bots? Who are We Studying: Humans or Bots?
Who are We Studying: Humans or Bots?
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)
 
Comprehensive Viewpoint Representations for a Deeper Understanding of User In...
Comprehensive Viewpoint Representations for a Deeper Understanding of User In...Comprehensive Viewpoint Representations for a Deeper Understanding of User In...
Comprehensive Viewpoint Representations for a Deeper Understanding of User In...
 
Strategic perspectives 3
Strategic perspectives 3Strategic perspectives 3
Strategic perspectives 3
 
COMS2200 Big data & Society Week 2 Crowdsourcing
COMS2200 Big data & Society Week 2 CrowdsourcingCOMS2200 Big data & Society Week 2 Crowdsourcing
COMS2200 Big data & Society Week 2 Crowdsourcing
 
Streaming Graph Processing and Analytics
Streaming Graph Processing and AnalyticsStreaming Graph Processing and Analytics
Streaming Graph Processing and Analytics
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?
 
Linas Eriksonas, The Impact of Time Zone Difference on Social Networks of En...
Linas Eriksonas, The Impact of Time Zone Difference on Social Networks of En...Linas Eriksonas, The Impact of Time Zone Difference on Social Networks of En...
Linas Eriksonas, The Impact of Time Zone Difference on Social Networks of En...
 
GIS-Pro September 2010 findings
GIS-Pro September  2010 findingsGIS-Pro September  2010 findings
GIS-Pro September 2010 findings
 
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
 
Data science innovations
Data science innovations Data science innovations
Data science innovations
 
From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
 
Participatory public data infrastructure: open data standards and the turn to...
Participatory public data infrastructure: open data standards and the turn to...Participatory public data infrastructure: open data standards and the turn to...
Participatory public data infrastructure: open data standards and the turn to...
 
The Challenges of Integrating Mapping and Texting for Community Development i...
The Challenges of Integrating Mapping and Texting for Community Development i...The Challenges of Integrating Mapping and Texting for Community Development i...
The Challenges of Integrating Mapping and Texting for Community Development i...
 
Gis pro sept 2010 findings
Gis pro sept  2010 findingsGis pro sept  2010 findings
Gis pro sept 2010 findings
 

Recently uploaded

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 

Recently uploaded (20)

Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 

Who’s in the Gang? Revealing Coordinating Communities in Social Media

  • 1. OFFICIALASONAM 7-10 Dec 2020 Derek Weber1,2 & Frank Neumann1 Contact: derek.weber@adelaide.edu.au 1 School of Computer Science, University of Adelaide, Australia. 2 Defence Science and Technology Group, Department of Defence, Australia. WHO’S IN THE GANG? REVEALING COORDINATING COMMUNITIES IN SOCIALMEDIA
  • 2. OFFICIALASONAM 7-10 Dec 2020 2 https://twitter.com/conspirator0/status/1328479128908132358 17 Nov 2020
  • 3. OFFICIALASONAM 7-10 Dec 2020 Context • Social media for political communication • Targeted marketing → (Political) Spam & recruitment • Anonymity → Trolls • Automation → Bots, social bots & political bots 3 Targeted marketing + Anonymity + Automation = Interference
  • 4. OFFICIALASONAM 7-10 Dec 2020 Coordination 4 Malone & Crowston (1994) T1 T2 T1 T2 R1 T1 T2 Starbird et al. (2019) 1 https://www.nbcnews.com/politics/national-security/russian-trolls-who-interfered-2016-u-s-election-also-made-n1013811 2 Bianoconi (2015) DOI: 10.1209/0295-5075/111/56001 3 https://commons.wikimedia.org/wiki/File:Social_media_icon.png 3 CultivatedOrchestrated 1 Emergent 2
  • 5. OFFICIALASONAM 7-10 Dec 2020 Information Campaigns & Coordination Strategies 5 Intent Strategy Planning Execution Post Repost time time Hostile Friendly Good Post Junk Post time Channel: e.g., #OurPartyRocks time t0 t1 t1 t2 t2 t3 Pollution Woolley (2016) Fisher (2018) Nasim et al. (2018) Boost Cao et al. (2015) Vo et al. (2017) Graham et al. (2020) Bully Hine et al. (2017) Kumar et al. (2018)
  • 6. OFFICIALASONAM 7-10 Dec 2020 The Challenge • Discovery • RQ1 How can highly coordinating communities (HCCs) be found? • Validation • RQ2 How do the discovered communities differ? • RQ3 How consistent is the HCC messaging? • RQ4 Are the HCCs internally or externally focused? 6 To identify groups of accounts whose behaviour, though typical in nature, is anomalous in degree.
  • 7. OFFICIALASONAM 7-10 Dec 2020 Approach 7 (1) (2) (3) (4) (5) @ # ,@,t1 ,#,t2 ,#,t3 ,@,t4 ,#,t5 Common behaviours Timed Interactions Evidence of coordination Highly Coordinating Communities Latent Coordination Network https://github.com/weberdc/find_hccs
  • 8. OFFICIALASONAM 7-10 Dec 2020 Temporal Aspect Time → γ 8 https://github.com/weberdc/find_hccs (1) (2) (3) (4) (5) @ # ,@,t1 ,#,t2 ,#,t3 ,@,t4 ,#,t5 γ γ
  • 9. OFFICIALASONAM 7-10 Dec 2020 Extract HCCs 9 Focal Structures Analysis1 – Variant (FSA_V) https://github.com/weberdc/find_hccs 1 Şen et al., 2016
  • 10. OFFICIALASONAM 7-10 Dec 2020 Extract HCCs 10 Focal Structures Analysis1 – Variant (FSA_V) https://github.com/weberdc/find_hccs 1 Şen et al., 2016
  • 11. OFFICIALASONAM 7-10 Dec 2020 Evaluation • Window size, γ = {15, 60, 360, 1440} minutes • Community extraction: • FSA_V, θ = 0.3 • K Nearest Neighour (kNN), k = ln(|V|) (cf. Cao et al., 2015) • Threshold • Coordination strategy • Boost (co-retweet) • Pollute (co-hashtag) • Bully (co-mention) 11
  • 12. OFFICIALASONAM 7-10 Dec 2020 Data DS1 – Australian regional election, 2018 • Including ground truth (GT, cf. Keller et al., 2017) DS2 – Twitter’s election integrity dataset1 • Internet Research Agency, 2016 tweets 12 1 https://about.twitter.com/en us/values/elections-integrity.html Tweets (T) Retweets (RT) Accts (A) Days T / A / Day RT / A / Day DS1 115.9k 64.2k 54.5% 20.6k 18 0.31 0.17 - GT 4.2k 2.5k 59.7% 134 18 1.74 1.04 DS2 1.57m 729.9k 56.6% 1.4k 365 3.12 1.45 Ethics University of Adelaide HREC H-2018-045https://github.com/weberdc/find_hccs
  • 13. OFFICIALASONAM 7-10 Dec 2020 Finding HCCs • Coordination Strategies • HCCs found in all • Many components (HCCs), incl. a very large one • kNN – single HCC with internal structure 13 DS1 DS2 FSA_V kNN Threshold FSA_V kNN Threshold GT Networks: Gephi https://gephi.org
  • 14. OFFICIALASONAM 7-10 Dec 2020 Membership • Window size, γ • Jaccard • Yellow = Identical • Blue = Disjoint 14 GT DS1 DS2
  • 15. OFFICIALASONAM 7-10 Dec 2020 Hashtags 15 Retweeting the same tweet Retweeting the same account GT DS1 DS2
  • 16. OFFICIALASONAM 7-10 Dec 2020 Consistency Hypothesis • Dissemination groups should have highly similar content • i.e., Int. similarity ≥ Ext. similarity Approach • For each group: • For each member: • Combine member tweets into a corpus • Compare 5-char n-grams of corpus against all other accounts • Plot similiarities as a matrix cf. a heatmap 16 GT DS1 DS2 RANDOM
  • 17. OFFICIALASONAM 7-10 Dec 2020 Connectivity & Focus 17 (log) (log) Retweets Mentions
  • 18. OFFICIALASONAM 7-10 Dec 2020 18 DNC RNC 1 https://botometer.osome.iu.edu/ 2 Rizoiu et al.(2018) 2 2 Botometer1 Recent Work: 2020 US Conventions
  • 19. OFFICIALASONAM 7-10 Dec 2020 19 19 Diagrams: visone https://visone.info Co-hashtag bigraphs DNC RNC
  • 20. OFFICIALASONAM 7-10 Dec 2020 Future • Community extraction • Stitching (cf. FSA) • Conductance cut clustering • Combinations • Statistics • Parameter choices • Anomaly validation • Temporal analysis • Evolution of HCCs • Edge semantics 20 OT1 10s 10s A B C w1 w2 OT1 OT2 10s 10s A B C w1 w2 ts(w2) >> ts(w1 )
  • 21. OFFICIALASONAM 7-10 Dec 2020 Conclusion • Targeted Marketing + Anonymity + Automation = Interference • Focus: detecting coordinating groups • Discovery: • RQ1: Pipeline model + community extraction (FSA_V) • Validation • RQ2: Distinct content patterns • RQ3: Internal consistency • RQ4: External focus • Recent & future work • More detail: arXiv:2010.08180 • Code: https://github.com/weberdc/find_hccs 21
  • 23. OFFICIALASONAM 7-10 Dec 2020 Literature Campaign detection • Content (Lee et al., 2013) • URL sharing (Cao et al., 2015) • Temporal signatures (Hine et al., 2017) • Cross-platform linking (Starbird & Wilson, 2020) Social bots • Agenda-oriented automated accounts pretending to be human (Ferrara et al., 2016) • Hard to identify (Cresci et al., 2017; Nasim et al., 2018; Grimme et al., 2018) Coordination as “orchestrated activities” • Focus on detecting strategies (Fisher, 2018; Grimme et al., 2018; Starbird et al., 2019; Weber, 2019) • Co-retweet (Weber, 2019; Graham et al., 2020) • Co-hashtag (Woolley, 2016; Fisher, 2018) • Co-URL (Cao et al., 2015; Giglietto et al., 2020) 23
  • 24. OFFICIALASONAM 7-10 Dec 2020 • Brooking, E. T., and Singer, P. W. (2016). War Goes Viral: How social media is being weaponized across the world. The Atlantic. Retrieved from https://www.theatlantic.com/magazine/archive/2016/11/war-goes-viral/501125/ • Cao, C., Caverlee, J., Lee, K., Ge, H. and Chung, J. 2015. Organic or Organized?: Exploring URL Sharing Behavior. CIKM’15, 513-522. • Cresci, S., Pietro, R. D., Petrocchi, M., Spognardi, A. and Tesconi, M. 2017. The Paradigm-Shift of Social Spambots. WWW’17 (Companion Volume), 963-972. • Ferrara, E., Varol, O., Davis, C., Menczer, F. and Flammini, A. 2016. The rise of social bots. Communications of the ACM. 59(7) (Jun. 2016), 96–104. DOI:10.1145/2818717. • Fisher, A. 2018. Netwar in Cyberia: Decoding the Media Mujahidin. USC Centre on Public Diplomacy, Figueroa Press. • Giglietto, F., Righetti, N., Rossi, L. and Marino, G. 2020. Coordinated Link Sharing Behavior as a Signal to Surface Sources of Problematic Information on Facebook. SMSociety, 85-91. • Grimme, C., Assenmacher, D. and Adam, L. 2018. Changing Perspectives: Is It Sufficient to Detect Social Bots? HCI (13) 2018, 445–461. • Graham, T., Bruns, A., Zhu, G., and Campbell, R. 2020. Like a virus: The coordinated spread of coronavirus disinformation. Centre for Responsible Technology, The Australia Institute. • Hine, G. E., Onaolapo, J., Cristofaro, E. D., Kourtellis, N., Leontiadis, I., Samaras, R., Stringhini, G. and Blackburn, J. 2017. Kek, Cucks, and God Emperor Trump: A Measurement Study of 4chan’s Politically Incorrect Forum and Its Effects on the Web. ICWSM’17, 92–101. • Keller, F.B., Schoch, D., Stier, S. and Yang, J.H. 2017. How to Manipulate Social Media: Analyzing Political Astroturfing Using Ground Truth Data from South Korea. ICWSM’17, 564–567 • Kumar, S., Hamilton, W.L., Leskovec, J. and Jurafsky, D. 2018. Community Interaction and Conflict on the Web. Proceedings of the 2018 World Wide Web Conference, WWW’18, 933–943 . 24 References (1)
  • 25. OFFICIALASONAM 7-10 Dec 2020 References (2) • Lee, K., Caverlee, J., Cheng, Z. and Sui, D. Z. 2013. Campaign extraction from social media. ACM Transactions on Intelligent Systems and Technology. 5(1), 9:1–9:28. DOI:10.1145/2542182.2542191. • Lim, K. H., Jayasekara, S., Karunasekera, S., Harwood, A., Falzon, L., Dunn, J. and Burgess, G. 2019. RAPID: Real-time Analytics Platform for Interactive Data Mining. KCML/PKDD (3) 2018. 649–653. • Nasim, M., Nguyen, A., Lothian, N., Cope, R. and Mitchell, L. 2018. Real-time Detection of Content Polluters in Partially Observable Twitter Networks. WWW’18 (Companion Volume), 1331-1339. • Pacheco, D., Hui, P.-M., Torres-Lugo, C., Truong, B. T., Flammini, A. and Menczer, F. 2020-01-16. Uncovering Coordinated Networks on Social Media. ICWSM’21, to appear. • Rizoiu, M.-A., Graham, T., Zhang, R., Zhang, Y., Ackland, R. and Xie, L. 2018. #DebateNight: The Role and Influence of Socialbots on Twitter During the 1st 2016 U.S. Presidential Debate. ICWSM’18, 300–309. • Saulwick, A., & Trentelman, K. (2014). Towards a formal semantics of social influence. Knowledge-Based Systems, 71, 52–60. DOI:10.1016/j.knosys.2014.06.022 • Şen, F., Wigand, R., Agarwal, N., Tokdemir, S., and Kasprzyk, R. 2016. Focal structures analysis: identifying influential sets of individuals in a social network. Social Network Analysis and Mining, 6(1). DOI:10.1007/s13278-016-0319-z • Starbird, K. and Wilson, T. 2020. Cross-Platform Disinformation Campaigns: Lessons Learned and Next Steps. Harvard Kennedy School Misinformation Review. (Jan. 2020). DOI:10.37016/mr-2020-002. • Starbird, K., Arif, A. and Wilson, T. 2019. Disinformation as Collaborative Work:: Surfacing the Participatory Nature of Strategic Information Operations . Proc. ACM on Human-Computer Interaction. 3 (CSCW), 127:1–127:26. DOI:10.1145/3359229. • Vo, N., Lee, K., Cao, C., Tran, T. and Choi, H. 2017. Revealing and detecting malicious retweeter groups. ASONAM’17, 363-368. • Weber, D. 2019. On Coordinated Online Behaviour. Poster presented at ASNAC’19, Adelaide, Australia. 25

Editor's Notes

  1. In mid-November there was a sudden spate of almost identical tweets posted, starting with “My wife just told me that she voted for Joe Biden. As we speak we are getting divorced and I’m leaving for” somewhere, “Papers signed everything is done.. Absolutely disgusted with the 2020 elections, what a disgrace”. This collection was posted by a data scientist with an interest in propaganda on social media. As you can see, the language is almost, but not quite, identical, and based on other analyses, these are not bots. An earlier instance of this “copypasta” related to the sale of a football club, and a huge number of accounts (https://twitter.com/conspirator0/status/1299127612804075523) posted pretty much the same message, which, in the scheme of things, is relatively harmless. This example, however, could have effects such as reinforcing the idea of rejecting election results, which hurts democratic systems. This kind of activity could be regarded as an information campaign, especially if it was seeded or supported by a foreign adversary, so it’s important to be able to identify such campaigns. As these accounts aren’t likely to be bots, bot detection systems won’t be as much help, unless they’re used for retweeting. What is, however, of particular interest, is finding the groups of accounts who are working together to do this, and to see how they’re coordinating their activities. Cf. “Swarmcast” p.10 from Fisher, A. 2018. “Netwar in Cyberia: Decoding the Media Mjahidin”, USC Center on Public Diplomacy, Paper 5.
  2. Online marketing used to be just spam, now it’s political advertising Anonymity is great for giving the disenfranchised a voice, but it also enables trolling, and I suspect Twitter’s latest ‘Fleets’ feature will make this worse. Automation is great for news aggregators and sports announcement bots, but it allows accounts to post vast amounts of polarising, biased information, mis- and disinformation. It’s now much easier for nation states to interfere with each other online discussion, particular political discussion.
  3. When I refer to coordination, there’s a spectrum. Fundamentally, as described by Malone and Crowston in the 90s, it’s the alignment of dependencies between tasks and the resources they use. At higher levels are Starbird et al’s descriptions of information campaigns being orchestrated (run from on high), cultivated (e.g., infiltrating existing issue-motivated groups), or emergent (like in conspiracy communities). In between is the space where specific communication actions occur. In our case, we’re interested in social media communications, which have many parallels, so methods to detecting reposting may apply to Twitter retweets, Facebook shares, or Tumblr Reposts.
  4. Intent Convince population to lose weight Strategy Ad blitz + word of mouth + tax incentives (Planning) Talk to ad company, design/disseminate flyers, craft legislation Execution TV ads, social media ads, flyers in fast food shops, pass laws & enforce These are some coordination strategies observed in the literature, … This is not an exhaustive list, of course, and coordination of activity may take many guises, but we’re focusing on co-actions, where a pair of accounts do the same thing to achieve their goal.
  5. To find the highly coordinating communities, or HCCs, we first [CLICK] Extract the abstracted common interaction behaviours from social media posts of a variety of platforms; [CLICK] Then create a multi-digraph of interactions between users, hashtags and URLs. [CLICK] This interaction graph is mined for evidence of coordination based on a search criterion, which is general and could be quite domain-specific. Examples of these include amplify-by-repost which is deliberate dissemination through content sharing, channel pollution through posting to particular hashtags or other communities, and coordinated attacks on a single user or community. [CLICK] A latent coordination network is constructed from this evidence, being a weighted undirected network of users, [CLICK] And this is then mined for the most highly coordinating communities. That covers how to search for HCCs in social media data, but we need to consider its temporal aspect. [CLICK] (shrink) [CLICK] (next slide)
  6. Given a timeline of social media posts, we segment them into windows of gamma minutes. We can vary the window depending on the nature of coordination sought.
  7. Lots of variables, so I’ll mostly focus on 15m windows, FSA_V and Boost via co-retweet. Threshold = retain heaviest normalised edges above 0.1
  8. Clearly different groups are found. The kNN results consist of a single large component, but I’ve used Gephi’s Force Atlas and then Fruchter Rheingold layouts to identify internal structures, and then applied the Louvain method to identify the clusters by colour. I’ve done this for visualisation purposes, but have not analysed the networks any more deeply yet. [CLICK] The final HCCs are the ones I discovered in the ground truth – each component consists of accounts from a different political party.
  9. How similar is the membership between the HCCs discovered in different window sizes? Quite a lot of variation.
  10. Different HCCs have clearly different content – looking at their hashtags, we can get a feel for what their interests are.
  11. HCC sizes: GT: 3,3,3 DS1: 18,17,14,14,12,7,7,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5… DS2: 65,5,4,3,3,3,3,3,3,3,2,2,2,2,2,2,2,2,2 RANDOM: 18,17,14,14,12,7,7,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,…
  12. Internal vs External focus Internal retweet ratio – how often do they retweet themselves? Internal mention ratio – how often do they mention themselves? Remember that the members of HCCs need not be directly connected – all their connections may be inferred.
  13. Looked at Twitter data over the recent 2020 US Democratic and Republican Conventions back in August. Using co-retweet and a 10 second window, I identified a number of communities, all of which Botometer tells us are highly bot-like, and many of which present themselves as normal people, but with greatly inflated tweeting rates. Named accounts are organisational or have been deleted.
  14. Co-hashtag with 10 second window to find the HCCs, then added the hashtags they use back in, to see which HCCs are associated. Structures tells us something about behaviour: Clusters around a few hashtags says many groups are, in fact, one Isolated stars tell us the accounts are pushing an agenda (the content of the hashtags) and no one else here is interested Fans are pushing an agenda but have connected with other communities here via the linking hashtags This tells me the community extraction method that I used (FSA_V) could do with a bit of tweaking and perhaps communities could be stitched back together.
  15. Future: Statistical measures Evolution of HCCs Simulation of coordination strategies
  16. Campaign detection spawned out of spam detection but has relied on a number of features over the years. Automation detection is particular important for social bots, accounts that masquerade as real humans, but are, in fact, automated. It’s so hard to tell bots and humans apart that the real question is more about how they work together to achieve goals – how do they orchestrate their activities? For example, by disseminating content by retweeting the same tweets, or URLs, or using the same hashtags.