SlideShare a Scribd company logo
1 of 1
Download to read offline
Hmm.. Why is the new
tax bill controversial?
1. Motivation
• Online controversies often emerge and evolve quickly due to the nature of
social media.
• Navigating social media platforms to learn about a new controversy is an
overwhelming task
• Users have to linearly scan postings to understand two conflicting stances
while fighting against:
• Filter bubble phenomenon
• A lot of noisy postings
2. Problem Definition
3. What makes a tweet a good summary?
4. Probabilistic Ranking Model 5. Experimental Setup
6. Evaluation and conclusion
Given a controversial topic, find a summary of k tweets that
best explains why the topic is controversial by ranking them
We assume that there are always two conflicting stances,
namely stance A and stance B, for a controversial topic, and
aim to generate a summary around those stances.
Application Scenario
• We know it’s not okay that for 40 yrs politicians have denied a woman
coverage of abortion just because she’s poor #BoldTheVote #BeBoldEndHyde
Topic
tweet2Vec summary
# tweets # users # tweets RT ratio
Election 10.8M 4.3M 10,000 70.9%
#TakeAKnee 565K 692K 44,167 71.1%
Abortion 692K 539K 3,477 57.6%
Feminism 1.7M 1.7M 50,323 41.3%
Climate Change 546K 360K 10,234 60.1%
0.0
0.2
0.4
0.6
0.8
1.0
Abortion
2016/11/4
Election
2016/10/14
Feminism
2016/3/8
Climate Change
2016/11/30
US Anthem
Protest
2017/09/30
Fractionoftimespreferred
Random mostRT Sumbasic-C
SumSAT SumSAT-C HastagSumSAT
Dataset
Methods
Evaluation
Summary by
method A
Summary by
method Bvs
Asked 10 people on AMT which
summary better explains the
controversy in a blind fashion.
• Random
• MostRT
• SumBasic (Nenkova and Vanderwende, 2005)
• SumSAT (-C)
• HashtagSumSAT (-C)
• Don’t support #RapeCulture by calling it #LockerroomTalk
• If you’re so pro life then go and f****ing get one?
Good summary tweets are characterized by a clear stance on the
controversial aspect of the topic described by assertive, non-vulgar
language and supported by ”stance hashtags” while being relevant.
We propose that a tweet is likely to be part of the good summary if:
• Stance-indicative
• Articulation
• Topic Relevance
a probability model
a regression model
a language model
score
4.1 Estimating Stance Indication
• Retweet communities are disconnected for
controversial topics (Garimella et al, 2016)
• Define stance hashtags as the hashtags that
are exclusively used in the retweet communities
Extract Stance
Hashtags
# #
#
#
#
Stance A
Stance B
Probability that a tweet
has hashtags of stance A
Probability that a tweet has
hashtags of stance B
#
Estimating latent Hashtags Probability
• Hashtags are incomplete user-annotated labels.
• An absence of a hashtag doesn’t necessarily mean that the hashtag isn’t the
right label.
• We trained tweet2vec (Dhingra et al, 2016) to obtain the embedding of the
tweets and hashtags to estimate the probability of P(h|t) for all hashtags.
• SUMSAT ranks the tweets by the score of the ranking function and take the top
k/2 tweets for two conflicting stances for the summary.
• HashtagSumSAT takes the top k/2 stance hashtags and find the top summary
tweet among those that contain the given stance hashtag.
4.2 Estimating Articulation
4.3 Summary Generation
• Trained a logistic regression model to predict how well-written tweets are
• Labeled 300 tweets with a binary label (articulate vs non-articulate)
• Features of the ratio of tweet POS Tags, OOV words, offensive words, POS Tags
N-grams, stop words as well as tweet length, avg. word length are used.
Myungha Jang and James Allan
Center for Intelligent Information Retrieval, University of Massachusetts Amherst
Explaining Controversy on Social Media via Stance Summarization
• Hashtag-based summarization seems to be more effective for event-
based controversies.
0.42 0.42
0.26
0.08
0.54
0.68
0.61 0.59
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Fractionoftimespreferred
random mostRT Sumbasic-C Sumbasic
SumSAT-C SumSAT HastagSumSAT-C HastagSumSAT
• SumSAT generates the summaries that were preferred the most
followed by HashtagSumSAT-C.
• A normal summarization technique fails: controversy summarization
is a new task.
• Social features are more useful than content features.
Original figure source: Garimella et al. (2015), Quantifying Controversy on Social Media
Sample tweets on ”Abortion” on Oct 21, 2017

More Related Content

What's hot

CRJ 305 Redefined Education--crj305.com
CRJ 305 Redefined Education--crj305.comCRJ 305 Redefined Education--crj305.com
CRJ 305 Redefined Education--crj305.comagathachristie210
 
Explore Your Twitter Analytics Dashboard
Explore Your Twitter Analytics DashboardExplore Your Twitter Analytics Dashboard
Explore Your Twitter Analytics DashboardTory Starr
 
Twitter School and #AntiquesRoadshow Live Tweet
Twitter School and #AntiquesRoadshow Live TweetTwitter School and #AntiquesRoadshow Live Tweet
Twitter School and #AntiquesRoadshow Live TweetTory Starr
 
I’m going to go... stalk... Lenny and Carl...
I’m going to go... stalk... Lenny and Carl...I’m going to go... stalk... Lenny and Carl...
I’m going to go... stalk... Lenny and Carl...volvent
 

What's hot (6)

CRJ 305 Redefined Education--crj305.com
CRJ 305 Redefined Education--crj305.comCRJ 305 Redefined Education--crj305.com
CRJ 305 Redefined Education--crj305.com
 
Learning Semantic Relationships between Entities in Twitter
Learning Semantic Relationships between Entities in TwitterLearning Semantic Relationships between Entities in Twitter
Learning Semantic Relationships between Entities in Twitter
 
Explore Your Twitter Analytics Dashboard
Explore Your Twitter Analytics DashboardExplore Your Twitter Analytics Dashboard
Explore Your Twitter Analytics Dashboard
 
Report v1
Report v1Report v1
Report v1
 
Twitter School and #AntiquesRoadshow Live Tweet
Twitter School and #AntiquesRoadshow Live TweetTwitter School and #AntiquesRoadshow Live Tweet
Twitter School and #AntiquesRoadshow Live Tweet
 
I’m going to go... stalk... Lenny and Carl...
I’m going to go... stalk... Lenny and Carl...I’m going to go... stalk... Lenny and Carl...
I’m going to go... stalk... Lenny and Carl...
 

Similar to Explaining Controversy on Social Media via Stance Summarization

Insights into the Twitterverse: Benchmarking and analysis twitter content
Insights into the Twitterverse: Benchmarking and analysis twitter contentInsights into the Twitterverse: Benchmarking and analysis twitter content
Insights into the Twitterverse: Benchmarking and analysis twitter contentStephen Dann
 
Franklin university humn 240 assignment help
Franklin university humn 240 assignment helpFranklin university humn 240 assignment help
Franklin university humn 240 assignment helpleesa marteen
 
Hao lyu slides_sarcasm
Hao lyu slides_sarcasmHao lyu slides_sarcasm
Hao lyu slides_sarcasmHao Lyu
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysisSunil Kandari
 
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...smine
 
New Improvements to Heartbeat 2.0
New Improvements to Heartbeat 2.0New Improvements to Heartbeat 2.0
New Improvements to Heartbeat 2.0Sysomos
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using Rsantoshi mangalgi
 
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...Savvas Zannettou
 
A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter Sultan Alzahrani
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaionRavindra Chaudhary
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter FeedsEu Jin Lok
 
How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?George Sam
 
These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...Eric Brown
 
GeospatialDataAnalysis
GeospatialDataAnalysisGeospatialDataAnalysis
GeospatialDataAnalysisTaylor Graham
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarRavi Kumar
 
TextMiningTwitters
TextMiningTwittersTextMiningTwitters
TextMiningTwittersLiu Chang
 
What I Can Find Out About You in 140 Characters
What I Can Find Out About You in 140 CharactersWhat I Can Find Out About You in 140 Characters
What I Can Find Out About You in 140 CharactersMallory McGuinness
 

Similar to Explaining Controversy on Social Media via Stance Summarization (20)

Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
 
Insights into the Twitterverse: Benchmarking and analysis twitter content
Insights into the Twitterverse: Benchmarking and analysis twitter contentInsights into the Twitterverse: Benchmarking and analysis twitter content
Insights into the Twitterverse: Benchmarking and analysis twitter content
 
Franklin university humn 240 assignment help
Franklin university humn 240 assignment helpFranklin university humn 240 assignment help
Franklin university humn 240 assignment help
 
Swdm15
Swdm15Swdm15
Swdm15
 
Hao lyu slides_sarcasm
Hao lyu slides_sarcasmHao lyu slides_sarcasm
Hao lyu slides_sarcasm
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysis
 
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...
Unpacking Altmetric Donuts: Content Analysis of Tweets to Scholarly Journal A...
 
New Improvements to Heartbeat 2.0
New Improvements to Heartbeat 2.0New Improvements to Heartbeat 2.0
New Improvements to Heartbeat 2.0
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using R
 
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...
 
A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter Feeds
 
How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?
 
These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...
 
GeospatialDataAnalysis
GeospatialDataAnalysisGeospatialDataAnalysis
GeospatialDataAnalysis
 
Adobe presentation
Adobe presentationAdobe presentation
Adobe presentation
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumar
 
TextMiningTwitters
TextMiningTwittersTextMiningTwitters
TextMiningTwitters
 
What I Can Find Out About You in 140 Characters
What I Can Find Out About You in 140 CharactersWhat I Can Find Out About You in 140 Characters
What I Can Find Out About You in 140 Characters
 

Recently uploaded

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 

Recently uploaded (20)

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 

Explaining Controversy on Social Media via Stance Summarization

  • 1. Hmm.. Why is the new tax bill controversial? 1. Motivation • Online controversies often emerge and evolve quickly due to the nature of social media. • Navigating social media platforms to learn about a new controversy is an overwhelming task • Users have to linearly scan postings to understand two conflicting stances while fighting against: • Filter bubble phenomenon • A lot of noisy postings 2. Problem Definition 3. What makes a tweet a good summary? 4. Probabilistic Ranking Model 5. Experimental Setup 6. Evaluation and conclusion Given a controversial topic, find a summary of k tweets that best explains why the topic is controversial by ranking them We assume that there are always two conflicting stances, namely stance A and stance B, for a controversial topic, and aim to generate a summary around those stances. Application Scenario • We know it’s not okay that for 40 yrs politicians have denied a woman coverage of abortion just because she’s poor #BoldTheVote #BeBoldEndHyde Topic tweet2Vec summary # tweets # users # tweets RT ratio Election 10.8M 4.3M 10,000 70.9% #TakeAKnee 565K 692K 44,167 71.1% Abortion 692K 539K 3,477 57.6% Feminism 1.7M 1.7M 50,323 41.3% Climate Change 546K 360K 10,234 60.1% 0.0 0.2 0.4 0.6 0.8 1.0 Abortion 2016/11/4 Election 2016/10/14 Feminism 2016/3/8 Climate Change 2016/11/30 US Anthem Protest 2017/09/30 Fractionoftimespreferred Random mostRT Sumbasic-C SumSAT SumSAT-C HastagSumSAT Dataset Methods Evaluation Summary by method A Summary by method Bvs Asked 10 people on AMT which summary better explains the controversy in a blind fashion. • Random • MostRT • SumBasic (Nenkova and Vanderwende, 2005) • SumSAT (-C) • HashtagSumSAT (-C) • Don’t support #RapeCulture by calling it #LockerroomTalk • If you’re so pro life then go and f****ing get one? Good summary tweets are characterized by a clear stance on the controversial aspect of the topic described by assertive, non-vulgar language and supported by ”stance hashtags” while being relevant. We propose that a tweet is likely to be part of the good summary if: • Stance-indicative • Articulation • Topic Relevance a probability model a regression model a language model score 4.1 Estimating Stance Indication • Retweet communities are disconnected for controversial topics (Garimella et al, 2016) • Define stance hashtags as the hashtags that are exclusively used in the retweet communities Extract Stance Hashtags # # # # # Stance A Stance B Probability that a tweet has hashtags of stance A Probability that a tweet has hashtags of stance B # Estimating latent Hashtags Probability • Hashtags are incomplete user-annotated labels. • An absence of a hashtag doesn’t necessarily mean that the hashtag isn’t the right label. • We trained tweet2vec (Dhingra et al, 2016) to obtain the embedding of the tweets and hashtags to estimate the probability of P(h|t) for all hashtags. • SUMSAT ranks the tweets by the score of the ranking function and take the top k/2 tweets for two conflicting stances for the summary. • HashtagSumSAT takes the top k/2 stance hashtags and find the top summary tweet among those that contain the given stance hashtag. 4.2 Estimating Articulation 4.3 Summary Generation • Trained a logistic regression model to predict how well-written tweets are • Labeled 300 tweets with a binary label (articulate vs non-articulate) • Features of the ratio of tweet POS Tags, OOV words, offensive words, POS Tags N-grams, stop words as well as tweet length, avg. word length are used. Myungha Jang and James Allan Center for Intelligent Information Retrieval, University of Massachusetts Amherst Explaining Controversy on Social Media via Stance Summarization • Hashtag-based summarization seems to be more effective for event- based controversies. 0.42 0.42 0.26 0.08 0.54 0.68 0.61 0.59 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Fractionoftimespreferred random mostRT Sumbasic-C Sumbasic SumSAT-C SumSAT HastagSumSAT-C HastagSumSAT • SumSAT generates the summaries that were preferred the most followed by HashtagSumSAT-C. • A normal summarization technique fails: controversy summarization is a new task. • Social features are more useful than content features. Original figure source: Garimella et al. (2015), Quantifying Controversy on Social Media Sample tweets on ”Abortion” on Oct 21, 2017