SlideShare a Scribd company logo
Unsupervised Model for Topic Viewpoint
Discovery in Online Debates Leveraging
Author Interactions
2018/11/8
SAKATA-MORI LAB M1
SYO KYOJIN
Summary
2
【Title and Authors】
- Unsupervised Model for Topic Viewpoint Discovery in
Online Debates Leveraging Author Interactions
- Amine Trabelsi, Osmar R. Za ̈ıane
- ICWSM 2018
【Data】
- Six datasets about four controversial issues, extracted from
4Forums.com (Abbott et al. 2016) and CreateDebate.com
(Hasan and Ng 2014).
【Contributions】
- Propose a purely unsupervised model to detect the
viewpoints at the post and author levels from the post
contents and the authors’ interactions.
- Achieve SOTA in the post level viewpoint identification and
close performance to the supervised SOTA approach in the
author level viewpoint identification.
- Discuss about author level viewpoint clustering.
Backgrounds
3
【Motivation】
- Recently, more and more people become able to to express their opinions on different political and social issues at
social media and online forums.
- Contentious issues : legalization of abortion, same-sex marriage and Donald Trump
- Those data are really useful for decision makers and politicians, but the amount of data is too huge. So an automatic
tool to provide a contrasting overview of the main viewpoints and reasons given by both sides is necessary.
- identify the viewpoints at the post and author’s levels
- detect the relevant discourse used to express opinion and reasoning themes.
Backgrounds
4
【Challenges】
- Applying sentiment analysis techniques on contentious documents is not sufficient to produce an effective solution
to the problem (Hasan and Ng 2013; Walker et al. 2012b).
- Mohammad, Sobhani, and Kiritchenko(2017) show that both positive and negative lexicons are used, in
contentious text, to express the same stance.
- Moreover, the stance can be implicitly conveyed through reasons and arguments, and not necessarily expressed
through polarity sentiment words.
- Challenges to accurate viewpoint detection can arise because of the dialogic nature of online debate (Hasan and Ng
2014; Boltuzˇic ́ and Sˇnajder 2014).
- There are many unstructured and colloquial language in the dialog of online debate i.e., containing non-
argumentative portions and irrelevant dialogs
- The similarity in words’ usage between authors leads to clustering errors when the model is entirely based on textual
features.
- This is often the case when a post rephrases the opposing side’s premise while attacking it or when asking a
rhetorical question.
- It has been shown that complementary features like the nature of the authors’ interactions at post level can
enhance pure text-based approaches in viewpoint distinguishing (Walker et al. 2012b).
Related Works
5
【Category】
- Evaluation series 2016 (SemEval-16) propose a shared task and datasets
for stance detection in Twitter (Mohammad et al. 2016).
- Viewpoint identification can be classified in terms of the type of
- Data : Twitter, Online Debate site
- Feature : Text, Authors’ interaction
- Task : Post level, Author level (Discourse level)
- Methods : Supervised, Unsupervised
【Supervised Learning】
- Sobhani, Inkpen, and Matwin (2015) tackle the viewpoint identification
for news comments using Topic Modeling.
- Hasan and Ng (2013) identify the stance at the post and the sentence
levels of online debates corpora. They construct a rich feature set of
linguistic and semantic features
- Sridhar et al. (2015) model disagreement and collectively predict
stances at the post and the author levels on online debate corpora.
They try different modeling approaches based on Probabilistic Soft Logic
(PSL).
【Examples of Legalization of Abortion】
<AGAINST>
Life, what a beautiful choice. Adoption. Not
abortion. #SemST
<AGAINST>
If you don't want your kid, put it up for
adoption. #sorrynotsorry #SemST
<FAVOR>
Lmao my school thinks giving women the
right to an abortion is against feminism.
They don't know feminism #wtf #SemST
<FAVOR>
@AsaSoltan putting abortion in with choices
for women just won my heart over.
Yaaaaaas! #shahs #SemST
Related works
6
【Unsupervised Learning】
- The most Popular approach is Topic-Viewpoint modeling, which hypothesizes the existence of underlying topic and
viewpoint variables that influence the author’s word choice. There are many different applied targets.
- Opinion polls, editorials (Paul, Zhai, and Girju 2010)
- Online forums (Trabelsi and Za ̈ıane 2016)
- Twitter (Joshi, Bhattacharyya, and Carman 2016)
- Parliamentary debates (Vilares and He 2017).
- Qiu and Jiang (2013) exploit the authors’ interactions to discover stances of posts and cluster authors with different
viewpoints. In their work, the polarity of the interactions between the authors, positive or negative, is guided and
determined using lexicon-based methods.
- Thonet et al. (2017) propose extended versions of Social Network-LDA (SN-LDA) (Sachan et al. 2014) that model the
viewpoint discovery in SNS : the Social Network Viewpoint Discovery Models (SNVDMs) in political Twitter data.
- Dong et al. (2017) focuse on a predicting the author’s stance and propose a weakly guided Stance-based Text
Generative Model with Link Regularization (STML) using the text content and the authors’ interactions in news
comments and online debates. The number of a discussion’s turns, the presence of agreement or disagreement
signals are used to guide.
Method
7
【Model】
- Propose Author Interaction Topic Viewpoint (AITV) Model
- For each word 𝑤 𝑑𝑎 in the post, the author draws a topic 𝑧 𝑛𝑑 from
𝜃 𝑑𝑎, then, samples each word 𝑤 𝑛𝑑 from the topic-viewpoint
distribution corresponding to topic 𝑧 𝑛𝑑 and viewpoint 𝜈 𝑑𝑎 ,
𝜙 𝑧 𝑛𝑑 𝜈 𝑑𝑎
.
- A : one author
- 𝐷 𝑎 : the number of the author’s post
- 𝑁𝑑𝑎 : the number of words in each 𝑑 𝑎
- K : the total number of topics
- L : the total number of viewpoints( = 2)
- 𝜃 𝑑𝑎 : the probability distribution of K topics under a post 𝑑 𝑎
- 𝜓 𝑎 : the probability distributions of viewpoints for an author a
- 𝜈 𝑑𝑎 : a viewpoint chosen from the distribution 𝜓 𝑎
- 𝜙 𝑎 : the multinomial probability distribution over words
associated with a topic k and a viewpoint l.
Method
8
【Model】
- 𝑅𝑏𝑖𝑑 : If the reply attacks the previous author then 𝑅𝑏𝑖𝑑 is set to 1
else if takes 0
- 𝑖𝑑 : a current post with index
- Parent posts : the posts which is the current post’s author is
replying to.
- 𝑅𝑏𝑖𝑑 depends on the degree of opposition between the viewpoint
𝜈𝑖𝑑 of the current post and the viewpoints 𝜈𝑖𝑑
𝑝𝑎𝑟
of its parent posts.
- where I(condition) equals 1 if the condition is true and η is a
smoothing parameter. This modeling of authors interactions is
similar to the users interactions setting presented in (Qiu and Jiang
2013).
Method
9
【Experiment Set Up】
- Remove identical portions of text in replying posts. We remove
stop and rare words. We consider working with the stemmed
version of the words.
- The 4Forums datasets contain the ground truth stance labels at the
author level, while the CreateDebate haven’t.
- Assign to each author the majority label of their
corresponding posts (Sridhar et al. 2015).
- All the reported results of AITV in this section correspond to
aggregation measures on 10 runs or repeats.
- The number of Topics K is 30 unless specified otherwise.
- Hyper parameters are set as follows: α = 0.1; β = 1; γ = 1; η =
0.01.
- The number of the Gibbs Sampling iterations is 1500.
- The words occurring less than 20 times are considered rare
words and are removed.
Results
10
【post level identification】
- AITV clearly outperforms PSL on each of the datasets.
- the best performances are recorded on the largest and
highest connected datasets (% reply posts).
- Compared AITV’s performance against its degenerate version
“AITV-Rebuttal Known” to compare AITV to a close version to
the weakly-guided work of Qiu and Jiang (2013).
- The difference of the knowledge of rebuttal is not
significant.
- This may be due to the automatic initialization based on
authors’ interactions which helps in reducing the variance
of the non-deterministic outputs, due to the Gibbs
Sampling. .
Results
11
【author level identification】
- AITV outperforms its rival unsupervised method SNVDM,
specifically for the datasets containing many interactions. It has
also close to comparable performance with the weakly guided
STML. However, AITV’s performance remains far from that of
the supervised PSL.
- We notice a big drop in accuracies between the post level
and the author level for AITV.
- We evaluate the two unspervised methods with the BCubed F-
Measure. the performances of AITV are overall better than
SNVDM-GPU.
Results
12
【author level identification】
- We evaluate the AITV’s Topic-Viewpoint clustering of the
unigram words.
- inferring the topic of the cluster by the unigram is
difficult.
- the used vocabulary may be very similar.
- In order to perform a better evaluation we choose to use
a representative sentence retrieved from a original data
by each topic-view unigram.
- Even in the same position, example 1 and example 2
are standing on different viewpoints.
- Similary, example 5 and example 7 are standing on
different viewpoints.
THANK YOU FOR LISTENING
ANY DISCUSSION?

More Related Content

What's hot

HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
WarNik Chow
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
Amenda Joy
 
Tweet sentiment analysis (Data mining)
Tweet sentiment analysis (Data mining)Tweet sentiment analysis (Data mining)
Tweet sentiment analysis (Data mining)
Anil Shrestha
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
Ayushi Dalmia
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
Ravindra Chaudhary
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Naveen Kumar
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
Carlos Castillo (ChaTo)
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
Shilpa Subrahmanyam
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
IJECEIAES
 
Ontology based sentiment analysis
Ontology based sentiment analysisOntology based sentiment analysis
Ontology based sentiment analysis
prathako
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
TELKOMNIKA JOURNAL
 
Creativity detection in texts
Creativity detection in textsCreativity detection in texts
Creativity detection in texts
University Politehnica Bucharest
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Prateek Singh
 
Social Network Analysis [1994]
Social Network Analysis [1994]Social Network Analysis [1994]
Social Network Analysis [1994]
sub-alkhalissi
 
E017433538
E017433538E017433538
E017433538
IOSR Journals
 
Project report
Project reportProject report
Project report
Utkarsh Soni
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
IJSRD
 
Information retrieval 7 boolean model
Information retrieval 7 boolean modelInformation retrieval 7 boolean model
Information retrieval 7 boolean model
Vaibhav Khanna
 
09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory
dnac
 
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...Carrie Wang
 

What's hot (20)

HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Tweet sentiment analysis (Data mining)
Tweet sentiment analysis (Data mining)Tweet sentiment analysis (Data mining)
Tweet sentiment analysis (Data mining)
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
 
Ontology based sentiment analysis
Ontology based sentiment analysisOntology based sentiment analysis
Ontology based sentiment analysis
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
 
Creativity detection in texts
Creativity detection in textsCreativity detection in texts
Creativity detection in texts
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
 
Social Network Analysis [1994]
Social Network Analysis [1994]Social Network Analysis [1994]
Social Network Analysis [1994]
 
E017433538
E017433538E017433538
E017433538
 
Project report
Project reportProject report
Project report
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
Information retrieval 7 boolean model
Information retrieval 7 boolean modelInformation retrieval 7 boolean model
Information retrieval 7 boolean model
 
09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory
 
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
New Quantitative Methodology for Identification of Drug Abuse Based on Featur...
 

Similar to Research on AITV

H046025258
H046025258H046025258
H046025258
IJERA Editor
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
NBER
 
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
ijiert bestjournal
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papersAshish Kulkarni
 
Evolving Swings (topics) from Social Streams using Probability Model
Evolving Swings (topics) from Social Streams using Probability ModelEvolving Swings (topics) from Social Streams using Probability Model
Evolving Swings (topics) from Social Streams using Probability Model
IJERA Editor
 
Blog track
Blog trackBlog track
Blog track
anilcs0405
 
opinion feature extraction using enhanced opinion mining technique and intrin...
opinion feature extraction using enhanced opinion mining technique and intrin...opinion feature extraction using enhanced opinion mining technique and intrin...
opinion feature extraction using enhanced opinion mining technique and intrin...
INFOGAIN PUBLICATION
 
Scholar's scientific collaboration through co-authorship networks
Scholar's scientific collaboration through co-authorship networksScholar's scientific collaboration through co-authorship networks
Scholar's scientific collaboration through co-authorship networks
Aliakbar Akbaritabar (Ali)
 
Graph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social NetworkGraph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social Network
Khan Mostafa
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
Parang Saraf
 
An Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation GenerationAn Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation Generation
Kayla Jones
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
20211a05p7
 
Cwpa 2016 comparative revision writing
Cwpa 2016 comparative revision writingCwpa 2016 comparative revision writing
Cwpa 2016 comparative revision writing
macktial
 
A comparative sentiment analysis of human, Gary Fisher
A comparative sentiment analysis of human, Gary FisherA comparative sentiment analysis of human, Gary Fisher
A comparative sentiment analysis of human, Gary Fisher
SEDA
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3Dave King
 
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
AIST
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
Journal For Research
 
Interaction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender SystemsInteraction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender Systems
University of Bergen
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
IJERA Editor
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
Kimberly Pulley
 

Similar to Research on AITV (20)

H046025258
H046025258H046025258
H046025258
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
Evolving Swings (topics) from Social Streams using Probability Model
Evolving Swings (topics) from Social Streams using Probability ModelEvolving Swings (topics) from Social Streams using Probability Model
Evolving Swings (topics) from Social Streams using Probability Model
 
Blog track
Blog trackBlog track
Blog track
 
opinion feature extraction using enhanced opinion mining technique and intrin...
opinion feature extraction using enhanced opinion mining technique and intrin...opinion feature extraction using enhanced opinion mining technique and intrin...
opinion feature extraction using enhanced opinion mining technique and intrin...
 
Scholar's scientific collaboration through co-authorship networks
Scholar's scientific collaboration through co-authorship networksScholar's scientific collaboration through co-authorship networks
Scholar's scientific collaboration through co-authorship networks
 
Graph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social NetworkGraph-based Analysis and Opinion Mining in Social Network
Graph-based Analysis and Opinion Mining in Social Network
 
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest EventsEMBERS AutoGSR: Automated Coding of Civil Unrest Events
EMBERS AutoGSR: Automated Coding of Civil Unrest Events
 
An Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation GenerationAn Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation Generation
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Cwpa 2016 comparative revision writing
Cwpa 2016 comparative revision writingCwpa 2016 comparative revision writing
Cwpa 2016 comparative revision writing
 
A comparative sentiment analysis of human, Gary Fisher
A comparative sentiment analysis of human, Gary FisherA comparative sentiment analysis of human, Gary Fisher
A comparative sentiment analysis of human, Gary Fisher
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
 
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
Interaction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender SystemsInteraction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender Systems
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
 

Recently uploaded

Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
SocioCosmos
 
Project Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docxProject Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docx
zeqirielmedina8
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
SocioCosmos
 
SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
grogshiregames
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
ClarissaAlanoCredito
 
Transform Your Presence Now!..............
Transform Your Presence Now!..............Transform Your Presence Now!..............
Transform Your Presence Now!..............
SocioCosmos
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
SocioCosmos
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
ryxqoswi
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
AJHSSR Journal
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
Virtual Real Design
 
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing AgencyThe Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
Digital Marketing Lab
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
grogshiregames
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
Febless Hernane
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
digitalcourseshop4
 
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
AJHSSR Journal
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
lorraineandreiamcidl
 
HOW TO USE THREADS an Instagram App_ by Clarissa Credito
HOW TO USE THREADS an Instagram App_ by Clarissa CreditoHOW TO USE THREADS an Instagram App_ by Clarissa Credito
HOW TO USE THREADS an Instagram App_ by Clarissa Credito
ClarissaAlanoCredito
 

Recently uploaded (17)

Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
 
Project Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docxProject Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docx
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
 
SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
 
Transform Your Presence Now!..............
Transform Your Presence Now!..............Transform Your Presence Now!..............
Transform Your Presence Now!..............
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
 
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing AgencyThe Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
 
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
 
HOW TO USE THREADS an Instagram App_ by Clarissa Credito
HOW TO USE THREADS an Instagram App_ by Clarissa CreditoHOW TO USE THREADS an Instagram App_ by Clarissa Credito
HOW TO USE THREADS an Instagram App_ by Clarissa Credito
 

Research on AITV

  • 1. Unsupervised Model for Topic Viewpoint Discovery in Online Debates Leveraging Author Interactions 2018/11/8 SAKATA-MORI LAB M1 SYO KYOJIN
  • 2. Summary 2 【Title and Authors】 - Unsupervised Model for Topic Viewpoint Discovery in Online Debates Leveraging Author Interactions - Amine Trabelsi, Osmar R. Za ̈ıane - ICWSM 2018 【Data】 - Six datasets about four controversial issues, extracted from 4Forums.com (Abbott et al. 2016) and CreateDebate.com (Hasan and Ng 2014). 【Contributions】 - Propose a purely unsupervised model to detect the viewpoints at the post and author levels from the post contents and the authors’ interactions. - Achieve SOTA in the post level viewpoint identification and close performance to the supervised SOTA approach in the author level viewpoint identification. - Discuss about author level viewpoint clustering.
  • 3. Backgrounds 3 【Motivation】 - Recently, more and more people become able to to express their opinions on different political and social issues at social media and online forums. - Contentious issues : legalization of abortion, same-sex marriage and Donald Trump - Those data are really useful for decision makers and politicians, but the amount of data is too huge. So an automatic tool to provide a contrasting overview of the main viewpoints and reasons given by both sides is necessary. - identify the viewpoints at the post and author’s levels - detect the relevant discourse used to express opinion and reasoning themes.
  • 4. Backgrounds 4 【Challenges】 - Applying sentiment analysis techniques on contentious documents is not sufficient to produce an effective solution to the problem (Hasan and Ng 2013; Walker et al. 2012b). - Mohammad, Sobhani, and Kiritchenko(2017) show that both positive and negative lexicons are used, in contentious text, to express the same stance. - Moreover, the stance can be implicitly conveyed through reasons and arguments, and not necessarily expressed through polarity sentiment words. - Challenges to accurate viewpoint detection can arise because of the dialogic nature of online debate (Hasan and Ng 2014; Boltuzˇic ́ and Sˇnajder 2014). - There are many unstructured and colloquial language in the dialog of online debate i.e., containing non- argumentative portions and irrelevant dialogs - The similarity in words’ usage between authors leads to clustering errors when the model is entirely based on textual features. - This is often the case when a post rephrases the opposing side’s premise while attacking it or when asking a rhetorical question. - It has been shown that complementary features like the nature of the authors’ interactions at post level can enhance pure text-based approaches in viewpoint distinguishing (Walker et al. 2012b).
  • 5. Related Works 5 【Category】 - Evaluation series 2016 (SemEval-16) propose a shared task and datasets for stance detection in Twitter (Mohammad et al. 2016). - Viewpoint identification can be classified in terms of the type of - Data : Twitter, Online Debate site - Feature : Text, Authors’ interaction - Task : Post level, Author level (Discourse level) - Methods : Supervised, Unsupervised 【Supervised Learning】 - Sobhani, Inkpen, and Matwin (2015) tackle the viewpoint identification for news comments using Topic Modeling. - Hasan and Ng (2013) identify the stance at the post and the sentence levels of online debates corpora. They construct a rich feature set of linguistic and semantic features - Sridhar et al. (2015) model disagreement and collectively predict stances at the post and the author levels on online debate corpora. They try different modeling approaches based on Probabilistic Soft Logic (PSL). 【Examples of Legalization of Abortion】 <AGAINST> Life, what a beautiful choice. Adoption. Not abortion. #SemST <AGAINST> If you don't want your kid, put it up for adoption. #sorrynotsorry #SemST <FAVOR> Lmao my school thinks giving women the right to an abortion is against feminism. They don't know feminism #wtf #SemST <FAVOR> @AsaSoltan putting abortion in with choices for women just won my heart over. Yaaaaaas! #shahs #SemST
  • 6. Related works 6 【Unsupervised Learning】 - The most Popular approach is Topic-Viewpoint modeling, which hypothesizes the existence of underlying topic and viewpoint variables that influence the author’s word choice. There are many different applied targets. - Opinion polls, editorials (Paul, Zhai, and Girju 2010) - Online forums (Trabelsi and Za ̈ıane 2016) - Twitter (Joshi, Bhattacharyya, and Carman 2016) - Parliamentary debates (Vilares and He 2017). - Qiu and Jiang (2013) exploit the authors’ interactions to discover stances of posts and cluster authors with different viewpoints. In their work, the polarity of the interactions between the authors, positive or negative, is guided and determined using lexicon-based methods. - Thonet et al. (2017) propose extended versions of Social Network-LDA (SN-LDA) (Sachan et al. 2014) that model the viewpoint discovery in SNS : the Social Network Viewpoint Discovery Models (SNVDMs) in political Twitter data. - Dong et al. (2017) focuse on a predicting the author’s stance and propose a weakly guided Stance-based Text Generative Model with Link Regularization (STML) using the text content and the authors’ interactions in news comments and online debates. The number of a discussion’s turns, the presence of agreement or disagreement signals are used to guide.
  • 7. Method 7 【Model】 - Propose Author Interaction Topic Viewpoint (AITV) Model - For each word 𝑤 𝑑𝑎 in the post, the author draws a topic 𝑧 𝑛𝑑 from 𝜃 𝑑𝑎, then, samples each word 𝑤 𝑛𝑑 from the topic-viewpoint distribution corresponding to topic 𝑧 𝑛𝑑 and viewpoint 𝜈 𝑑𝑎 , 𝜙 𝑧 𝑛𝑑 𝜈 𝑑𝑎 . - A : one author - 𝐷 𝑎 : the number of the author’s post - 𝑁𝑑𝑎 : the number of words in each 𝑑 𝑎 - K : the total number of topics - L : the total number of viewpoints( = 2) - 𝜃 𝑑𝑎 : the probability distribution of K topics under a post 𝑑 𝑎 - 𝜓 𝑎 : the probability distributions of viewpoints for an author a - 𝜈 𝑑𝑎 : a viewpoint chosen from the distribution 𝜓 𝑎 - 𝜙 𝑎 : the multinomial probability distribution over words associated with a topic k and a viewpoint l.
  • 8. Method 8 【Model】 - 𝑅𝑏𝑖𝑑 : If the reply attacks the previous author then 𝑅𝑏𝑖𝑑 is set to 1 else if takes 0 - 𝑖𝑑 : a current post with index - Parent posts : the posts which is the current post’s author is replying to. - 𝑅𝑏𝑖𝑑 depends on the degree of opposition between the viewpoint 𝜈𝑖𝑑 of the current post and the viewpoints 𝜈𝑖𝑑 𝑝𝑎𝑟 of its parent posts. - where I(condition) equals 1 if the condition is true and η is a smoothing parameter. This modeling of authors interactions is similar to the users interactions setting presented in (Qiu and Jiang 2013).
  • 9. Method 9 【Experiment Set Up】 - Remove identical portions of text in replying posts. We remove stop and rare words. We consider working with the stemmed version of the words. - The 4Forums datasets contain the ground truth stance labels at the author level, while the CreateDebate haven’t. - Assign to each author the majority label of their corresponding posts (Sridhar et al. 2015). - All the reported results of AITV in this section correspond to aggregation measures on 10 runs or repeats. - The number of Topics K is 30 unless specified otherwise. - Hyper parameters are set as follows: α = 0.1; β = 1; γ = 1; η = 0.01. - The number of the Gibbs Sampling iterations is 1500. - The words occurring less than 20 times are considered rare words and are removed.
  • 10. Results 10 【post level identification】 - AITV clearly outperforms PSL on each of the datasets. - the best performances are recorded on the largest and highest connected datasets (% reply posts). - Compared AITV’s performance against its degenerate version “AITV-Rebuttal Known” to compare AITV to a close version to the weakly-guided work of Qiu and Jiang (2013). - The difference of the knowledge of rebuttal is not significant. - This may be due to the automatic initialization based on authors’ interactions which helps in reducing the variance of the non-deterministic outputs, due to the Gibbs Sampling. .
  • 11. Results 11 【author level identification】 - AITV outperforms its rival unsupervised method SNVDM, specifically for the datasets containing many interactions. It has also close to comparable performance with the weakly guided STML. However, AITV’s performance remains far from that of the supervised PSL. - We notice a big drop in accuracies between the post level and the author level for AITV. - We evaluate the two unspervised methods with the BCubed F- Measure. the performances of AITV are overall better than SNVDM-GPU.
  • 12. Results 12 【author level identification】 - We evaluate the AITV’s Topic-Viewpoint clustering of the unigram words. - inferring the topic of the cluster by the unigram is difficult. - the used vocabulary may be very similar. - In order to perform a better evaluation we choose to use a representative sentence retrieved from a original data by each topic-view unigram. - Even in the same position, example 1 and example 2 are standing on different viewpoints. - Similary, example 5 and example 7 are standing on different viewpoints.
  • 13. THANK YOU FOR LISTENING ANY DISCUSSION?

Editor's Notes

  1. Rhetorical question 反語
  2. Propose a sharedSemEval-16ではデータ All these described methods extensively rely on human annotations, which are expensive to obtain, and on supervision which does not guar- antee scaling to different domains and types of data.
  3. 世論調査や社説欄 Most of the Topic-Viewpoint models do not necessarily attempt to cluster the posts or their authors. different extended versions of Social Network-LDA (SN-LDA) (Sachan et al. 2014) ポリアのツボ The SNVDM-GPU, the version based on Generalized Polya Urn sampling, is performing the best among all degenerate versions. It aims to overcome the sparsity of interactions.
  4. 投稿者のワードのチョイスは トピック +