SlideShare a Scribd company logo
What makes a bot a bot? Exploring benign
automation on Twitter
Dr. Felix Victor Münch1
; Ben Thies1
, B.A.; Dr. Cornelius Puschmann1
; Dr. Axel Bruns2
1
Leibniz Institute for Media Research | Hans-Bredow-Institut (HBI), Germany
2
Digital Media Research Centre (DMRC), Queensland University of Technology (QUT), Australia
http://bit.ly/whatabot
Background
Background
● Most Twitter-related bot research usually focuses on activity by examining
Tweets, Hashtags, Keywords, @-mentions, etc.*
● Most research focuses on malign automation, sometimes even assumes
that most automation is malign, and often cannot estimate its
impact
This research is focused on the Twitter follow network, takes a nuanced
approach to automation and assesses bot impact via the Twitter follow
network
*
https://www.pewinternet.org/2018/04/09/bots-in-the-twittersphere/
Research Goals
● provide an overview of bot
prevalence, influence, and
roles of automated accounts in
the German-speaking
Twittersphere
● develop and test
tools/methods that enable
such analyses for other
languages as well
● do some groundwork for a
theory of social
automation by cataloguing
suspected bots
Method
1. Create a language-based (German) Twitter follow network sample of most
influential accounts (method preprint: https://bit.ly/twitterwalk)
2. Use Botometer to estimate automation probabilities and apply probability
threshold of 0.75 to mark a suspected bot
3. Manually check 100 suspected bots prioritised by Page Rank and code them
inductively
Step 1: Drawing a Sample
Our adaptation of the ‘rank degree’ method
More details in our preprint: http://bit.ly/twitterwalk
Bottom: Original graph without walked edges. Starting nodes (seeds) are drawn randomly (1) and
walker move to their friend with the highest in-degree (2-6). Walked edges get removed/‘burned’.
Top: Current sample at each step. Walked (and symmetric) edges are added to sample.
1 2 3 4 5 6
Coverage
Distribution of public accounts with > 1 friend in the test sample over the percentage of their friends that
can be found in the influencer sample (left, filtered for in-degree >= 1, leaving 199,180 accounts) / baseline
sample (right, same size, randomly drawn from German accounts in global dataset)
(A Sample of) The German Twittersphere
● ~ 1 Mio Accounts, 1.6
Mio. Edges
● 6 Months of collection
using API authentication
keys of 10 collaborators
● Figure: 3-core of sample
network, coloured by
communities detected
with infomap. Node size
indicates Page Rank
Keyword extraction with chi-squared criterion
active
accounts keywords top accounts tag
2015
e3, stream, xd, e32019, nintendo, twitch, game, crossing, pc, zelda, animal, gameplay, cyberpunk2077,
games, switch, xbox, trailer, cyberpunk, gaming, xboxe3, uff, nice, awesome, keanu, pk, live, nen, lol,
mega
unge, dagibee, Gronkh, MelinaSophie, LeFloid,
iBlali, Taddl, rewinside, HandIOfIBlood, PietSmiet
YouTubers &
Gaming
1855
berlin, innen (female suffix), spd (German party), berliner, study, companies, discuss, cdu (party),
demand, topics, important, federal government, german, digitisation, topic, has been, annefrank,
climate protection, more, june, shows, germany, interview, politics, brandenburg
tazgezwitscher, Die_Gruenen, Tagesspiegel,
c_lindner, gutjahr, dunjahayali, sigmargabriel,
sixtus, HeikoMaas, spdde German politics
1414
women’s strike, switzerland, swiss people, glarner, svp (Swiss party), bern, women’s strike2019, zurich,
canton, grand, basel, national council, women
NZZ, 20min, viktorgiacobbo, Blickch,
tagesanzeiger, srfnews, MikeMuellerLate,
watson_news, migros, srf3
Swiss politics /
women's strike
1044
season, trainer, bundesliga, new arrival, player, dfb, fc, gerest, em, exchanges, estonia, exchange,
transfer, goal, victory, team, wm, cup, liga
DFB_Team, FCBayern, ToniKroos, MarioGoetze,
esmuellert_, Podolski10, Manuel_Neuer,
Bundesliga_DE, ZDFsport, JB17Official German football
767
övp (Austrian party), spö (Austrian party), fpö (Austrian party), vienna, austria, bierlein, oenr,
austria‘s, ibiza, viennese, strache, kickl, turquoise, hofer, parliament, national council, zib2, abdullah,
mandate, election campaign, centre, chancellor (female form), heinz, austrian, glyphosate, proposal,
blue
florianklenk, sebastiankurz, IngridThurnher,
kesslermichael, HannoSettele, vanderbellen,
Gawhary, HBrandstaetter, HHumorlos,
michelreimon Austrian politics
376
dessau, roßlau, afd (German (far-)right-wing party), görlitz, islam, raped, migrants, asylum seeker,
dangerous person, rejected person, sed, wippel, patriots, greta, niger, hosni, strongest, green,
fridayforfuture, african, rosslau, left, gretathunberg, crime, old parties, merkel, radical left, greens,
habeck, keep silent, tear apart, vote, girl, refugee, saxony, rape, maas, citizen, islamistic, sexual
DonJoschi, AfD, MSF_austria, Alice_Weidel,
SteinbachErika, Joerg_Meuthen, Beatrix_vStorch,
GrumpyMerkel, krone_at
hard right /
xenophobia /
migration /
refugees
Tagged Community graph Community graph of communities
in the 3-core of our sample with
over 300 accounts, at least 80 active
accounts during the examined time
frame, and edges with a weight of at
least 150; edge width represents
weight; edge direction follows
clockwise curvature; edges coloured
by source node; node size represents
the number of accounts in each
community
Step 2: Use Botometer1
to identify suspected Bots
1
https://botometer.iuni.iu.edu
Suspected bots cluster mostly outside of topical communities
Suspected bots cluster mostly outside of topical communities
Suspected bots cluster mostly outside of topical communities
Step 3: Rate suspected Bots with a
human rater
Inductive Coding of Suspected Bots
Only 58% of the accounts in the “75% bot” sample
seem to be automated
The majority of automated accounts is benign
Communities of automated accounts
Automation types
Automation Purposes
Preliminary results
● Prevalence of (suspected) automated accounts in line with prior analyses 1
● Low centrality of automated accounts (usually outside central clusters)
● Automated accounts are primarily located outside of politics and news clusters
● Most automated accounts are benign
● Botometer: false-positive rate higher than selected threshold (58% vs. 75% set
threshold) according to human ratings.
● But: good approach to inductively explore account-categories and test detection
methods
Overall, the negative impact of automated accounts seems rather low in our
sample.
1
Varol, O., Ferrara, E., Davis, C. A., Menczer, F., & Flammini, A. (2017). Online Human-Bot Interactions: Detection,
Estimation, and Characterization. In International AAAI Conference on Web and Social Media. Retrieved from
https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15587/14817
(Selected) Limitations / What to do next? / Outlook
This study Other studies In general
Sampling ● Method still
experimental
● Missing peripheral
auto accounts with
low impact (‘bot
swarms’)
Activity/keyword based or
random
● Impact/activity
assessment
problematic
● ‘SEO problem’: bots
optimise on
keywords
● Population unknown
● No concept for
representativity in
networks/media
environments with highly
skewed distributions of
degree/attention
Bot
detection
● Needs triangulation
with other
automated tools
● Needs follow-up
with multiple coders
● ‘Bad Bot’ stigma:
Quiet assumption
that automation is
malign
● Too much trust in
automated tools
● Automated tools often
developed and trained
without a working theory
of social automation
● Binary concept of ‘botness’
instead of
multidimensional
approach to
(semi-)automation
So what makes a bot a bot?
Advertising
+ =
https://github.com/FlxVctr/RADICES
(Continuous Development)
Thanks!
@FlxVctr, @BenAThies, @cbpuschmann, @snurb_dot_info

More Related Content

Similar to What makes a bot a bot? Exploring benign automation on Twitter

Τweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match FixingΤweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match Fixing
Antigoni-Maria Founta
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Eirini Ntoutsi
 
Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»
Newreporter.org Sukhacheva
 
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako..."How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
UNECE Statistics
 
Marsden Disinformation Algorithms #IGF2019
Marsden Disinformation Algorithms #IGF2019 Marsden Disinformation Algorithms #IGF2019
Marsden Disinformation Algorithms #IGF2019
Chris Marsden
 
Extracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual dataExtracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual data
Vasileios Lampos
 
Twitter data mining - social media as a rising customer service platform
Twitter data mining - social media as a rising customer service platformTwitter data mining - social media as a rising customer service platform
Twitter data mining - social media as a rising customer service platform
Sirong Huang
 
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulationFIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
apolon1hermes1
 
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM Events
 
The 2015 European Parliament Digital Trends Survey - Full Results
The 2015 European Parliament Digital Trends Survey - Full ResultsThe 2015 European Parliament Digital Trends Survey - Full Results
The 2015 European Parliament Digital Trends Survey - Full Results
FleishmanHillard Brussels
 
Dream House Essay Example
Dream House Essay ExampleDream House Essay Example
Dream House Essay Example
Wendy Bolden
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
FIAT/IFTA
 
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko Jokirinta
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko JokirintaLeading digital. #rethinkingmarketing /TweetAtlas Jarkko Jokirinta
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko JokirintaDagmar
 
Search for Sentiment
Search for SentimentSearch for Sentiment
Search for Sentiment
Seth Grimes
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
Karol Chlasta
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
DATAVERSITY
 
Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...
Giacomo Carozza
 
Best friends forever nina smidt_presentation_25.6.2018_am_cham
Best friends forever  nina smidt_presentation_25.6.2018_am_chamBest friends forever  nina smidt_presentation_25.6.2018_am_cham
Best friends forever nina smidt_presentation_25.6.2018_am_cham
Prof. Dr. Torsten Oltmanns
 
Twitter sentimentanalysis report
Twitter sentimentanalysis reportTwitter sentimentanalysis report
Twitter sentimentanalysis report
Savio Aberneithie
 

Similar to What makes a bot a bot? Exploring benign automation on Twitter (20)

Τweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match FixingΤweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match Fixing
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
 
Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»
 
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako..."How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
"How Social Media Monitoring Can Help Corporate Communication" by Nadine Jako...
 
Marsden Disinformation Algorithms #IGF2019
Marsden Disinformation Algorithms #IGF2019 Marsden Disinformation Algorithms #IGF2019
Marsden Disinformation Algorithms #IGF2019
 
Extracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual dataExtracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual data
 
Twitter data mining - social media as a rising customer service platform
Twitter data mining - social media as a rising customer service platformTwitter data mining - social media as a rising customer service platform
Twitter data mining - social media as a rising customer service platform
 
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulationFIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
FIX the Fixing PROactive quelling of SPORTS EVENTS manipulation
 
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
 
The 2015 European Parliament Digital Trends Survey - Full Results
The 2015 European Parliament Digital Trends Survey - Full ResultsThe 2015 European Parliament Digital Trends Survey - Full Results
The 2015 European Parliament Digital Trends Survey - Full Results
 
Dream House Essay Example
Dream House Essay ExampleDream House Essay Example
Dream House Essay Example
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
 
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko Jokirinta
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko JokirintaLeading digital. #rethinkingmarketing /TweetAtlas Jarkko Jokirinta
Leading digital. #rethinkingmarketing /TweetAtlas Jarkko Jokirinta
 
Search for Sentiment
Search for SentimentSearch for Sentiment
Search for Sentiment
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
 
Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...
 
wendi_ppt
wendi_pptwendi_ppt
wendi_ppt
 
Best friends forever nina smidt_presentation_25.6.2018_am_cham
Best friends forever  nina smidt_presentation_25.6.2018_am_chamBest friends forever  nina smidt_presentation_25.6.2018_am_cham
Best friends forever nina smidt_presentation_25.6.2018_am_cham
 
Twitter sentimentanalysis report
Twitter sentimentanalysis reportTwitter sentimentanalysis report
Twitter sentimentanalysis report
 

Recently uploaded

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 

Recently uploaded (20)

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 

What makes a bot a bot? Exploring benign automation on Twitter

  • 1. What makes a bot a bot? Exploring benign automation on Twitter Dr. Felix Victor Münch1 ; Ben Thies1 , B.A.; Dr. Cornelius Puschmann1 ; Dr. Axel Bruns2 1 Leibniz Institute for Media Research | Hans-Bredow-Institut (HBI), Germany 2 Digital Media Research Centre (DMRC), Queensland University of Technology (QUT), Australia
  • 4. Background ● Most Twitter-related bot research usually focuses on activity by examining Tweets, Hashtags, Keywords, @-mentions, etc.* ● Most research focuses on malign automation, sometimes even assumes that most automation is malign, and often cannot estimate its impact This research is focused on the Twitter follow network, takes a nuanced approach to automation and assesses bot impact via the Twitter follow network * https://www.pewinternet.org/2018/04/09/bots-in-the-twittersphere/
  • 5. Research Goals ● provide an overview of bot prevalence, influence, and roles of automated accounts in the German-speaking Twittersphere ● develop and test tools/methods that enable such analyses for other languages as well ● do some groundwork for a theory of social automation by cataloguing suspected bots
  • 6. Method 1. Create a language-based (German) Twitter follow network sample of most influential accounts (method preprint: https://bit.ly/twitterwalk) 2. Use Botometer to estimate automation probabilities and apply probability threshold of 0.75 to mark a suspected bot 3. Manually check 100 suspected bots prioritised by Page Rank and code them inductively
  • 7. Step 1: Drawing a Sample
  • 8. Our adaptation of the ‘rank degree’ method More details in our preprint: http://bit.ly/twitterwalk Bottom: Original graph without walked edges. Starting nodes (seeds) are drawn randomly (1) and walker move to their friend with the highest in-degree (2-6). Walked edges get removed/‘burned’. Top: Current sample at each step. Walked (and symmetric) edges are added to sample. 1 2 3 4 5 6
  • 9. Coverage Distribution of public accounts with > 1 friend in the test sample over the percentage of their friends that can be found in the influencer sample (left, filtered for in-degree >= 1, leaving 199,180 accounts) / baseline sample (right, same size, randomly drawn from German accounts in global dataset)
  • 10. (A Sample of) The German Twittersphere ● ~ 1 Mio Accounts, 1.6 Mio. Edges ● 6 Months of collection using API authentication keys of 10 collaborators ● Figure: 3-core of sample network, coloured by communities detected with infomap. Node size indicates Page Rank
  • 11. Keyword extraction with chi-squared criterion active accounts keywords top accounts tag 2015 e3, stream, xd, e32019, nintendo, twitch, game, crossing, pc, zelda, animal, gameplay, cyberpunk2077, games, switch, xbox, trailer, cyberpunk, gaming, xboxe3, uff, nice, awesome, keanu, pk, live, nen, lol, mega unge, dagibee, Gronkh, MelinaSophie, LeFloid, iBlali, Taddl, rewinside, HandIOfIBlood, PietSmiet YouTubers & Gaming 1855 berlin, innen (female suffix), spd (German party), berliner, study, companies, discuss, cdu (party), demand, topics, important, federal government, german, digitisation, topic, has been, annefrank, climate protection, more, june, shows, germany, interview, politics, brandenburg tazgezwitscher, Die_Gruenen, Tagesspiegel, c_lindner, gutjahr, dunjahayali, sigmargabriel, sixtus, HeikoMaas, spdde German politics 1414 women’s strike, switzerland, swiss people, glarner, svp (Swiss party), bern, women’s strike2019, zurich, canton, grand, basel, national council, women NZZ, 20min, viktorgiacobbo, Blickch, tagesanzeiger, srfnews, MikeMuellerLate, watson_news, migros, srf3 Swiss politics / women's strike 1044 season, trainer, bundesliga, new arrival, player, dfb, fc, gerest, em, exchanges, estonia, exchange, transfer, goal, victory, team, wm, cup, liga DFB_Team, FCBayern, ToniKroos, MarioGoetze, esmuellert_, Podolski10, Manuel_Neuer, Bundesliga_DE, ZDFsport, JB17Official German football 767 övp (Austrian party), spö (Austrian party), fpö (Austrian party), vienna, austria, bierlein, oenr, austria‘s, ibiza, viennese, strache, kickl, turquoise, hofer, parliament, national council, zib2, abdullah, mandate, election campaign, centre, chancellor (female form), heinz, austrian, glyphosate, proposal, blue florianklenk, sebastiankurz, IngridThurnher, kesslermichael, HannoSettele, vanderbellen, Gawhary, HBrandstaetter, HHumorlos, michelreimon Austrian politics 376 dessau, roßlau, afd (German (far-)right-wing party), görlitz, islam, raped, migrants, asylum seeker, dangerous person, rejected person, sed, wippel, patriots, greta, niger, hosni, strongest, green, fridayforfuture, african, rosslau, left, gretathunberg, crime, old parties, merkel, radical left, greens, habeck, keep silent, tear apart, vote, girl, refugee, saxony, rape, maas, citizen, islamistic, sexual DonJoschi, AfD, MSF_austria, Alice_Weidel, SteinbachErika, Joerg_Meuthen, Beatrix_vStorch, GrumpyMerkel, krone_at hard right / xenophobia / migration / refugees
  • 12. Tagged Community graph Community graph of communities in the 3-core of our sample with over 300 accounts, at least 80 active accounts during the examined time frame, and edges with a weight of at least 150; edge width represents weight; edge direction follows clockwise curvature; edges coloured by source node; node size represents the number of accounts in each community
  • 13. Step 2: Use Botometer1 to identify suspected Bots 1 https://botometer.iuni.iu.edu
  • 14.
  • 15. Suspected bots cluster mostly outside of topical communities
  • 16. Suspected bots cluster mostly outside of topical communities
  • 17. Suspected bots cluster mostly outside of topical communities
  • 18. Step 3: Rate suspected Bots with a human rater
  • 19. Inductive Coding of Suspected Bots
  • 20. Only 58% of the accounts in the “75% bot” sample seem to be automated
  • 21. The majority of automated accounts is benign
  • 25. Preliminary results ● Prevalence of (suspected) automated accounts in line with prior analyses 1 ● Low centrality of automated accounts (usually outside central clusters) ● Automated accounts are primarily located outside of politics and news clusters ● Most automated accounts are benign ● Botometer: false-positive rate higher than selected threshold (58% vs. 75% set threshold) according to human ratings. ● But: good approach to inductively explore account-categories and test detection methods Overall, the negative impact of automated accounts seems rather low in our sample. 1 Varol, O., Ferrara, E., Davis, C. A., Menczer, F., & Flammini, A. (2017). Online Human-Bot Interactions: Detection, Estimation, and Characterization. In International AAAI Conference on Web and Social Media. Retrieved from https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15587/14817
  • 26. (Selected) Limitations / What to do next? / Outlook This study Other studies In general Sampling ● Method still experimental ● Missing peripheral auto accounts with low impact (‘bot swarms’) Activity/keyword based or random ● Impact/activity assessment problematic ● ‘SEO problem’: bots optimise on keywords ● Population unknown ● No concept for representativity in networks/media environments with highly skewed distributions of degree/attention Bot detection ● Needs triangulation with other automated tools ● Needs follow-up with multiple coders ● ‘Bad Bot’ stigma: Quiet assumption that automation is malign ● Too much trust in automated tools ● Automated tools often developed and trained without a working theory of social automation ● Binary concept of ‘botness’ instead of multidimensional approach to (semi-)automation
  • 27. So what makes a bot a bot?