SlideShare a Scribd company logo
Expressing Opinion Diversity

Andreea Bizău – Babeș Bolyai University,
Cluj-Napoca, Romania
Delia Rusu, Dunja Mladenić - Jožef Stefan
Institute, Ljubljana, Slovenia


                                  ailab.ijs.si
Motivation: Opinion Diversity




                       ailab.ijs.si
Overview

Terminology
Related work
Domain Driven Opinion Vocabulary
Use Case:
  Twitter Movie Comments
Conclusions and Future Work




                                   ailab.ijs.si
Terminology
opinion - a subjective expression of sentiments,
appraisals or feelings
opinion words - a set of keywords/phrases used
in expressing an opinion
orientation of an opinion word indicates whether
the opinion expressed is positive, negative or
neutral
the totality of opinion words forms an opinion
vocabulary


                                     ailab.ijs.si
Related Work – Opinion
                  Vocabulary
Dictionary based
  Esuli and Sebastiani, 2006 - SentiWordNet: three
  numerical scores (objective, positive, negative)
Corpus based
  Kanayama and Nasukawa, 2006 - context coherency
  (same polarity tend to appear successively)
  Jialin Pan et al, 2010 - feature bipartite graph modeling the
  relationship between domain-specific words and domain-
  independent words.
Dictionary and Corpus based
  Jijkoun et al, 2010 - dependency parsing on a set of
  relevant documents, extracting patterns of the form (clue
  word, syntactic context, target of sentiment)

                                                 ailab.ijs.si
Related Work – Opinion Analysis
Approaches:
  Hatzivassiloglou et al, 1997 – relevance of using
  connectives (conjunctions: and, or, but, etc)
  Kim and Hovy, 2004 – use word seed lists and WordNet
  synsets to determine the strength of the opinion orientation
  for the identified opinion words
  Gamon and Aue, 2005 – assign opinion orientation to
  candidate words, assuming that opinion terms with similar
  orientation tend to co-occur
Twitter Applications:
  Pang and Lee, 2002 – classifying movie reviews by overall
  document sentiment
  Asur and Huberman, 2010 – sentiments extracted from
  Twitter can be used to build a prediction model for box-
  office revenue

                                                ailab.ijs.si
Our Approach
                                                Domain-specific
         IMDb Movie reviews                     opinion
             (sample)                           vocabulary

                                                  Weird, odd,
              2 Clusters                             bad
                Weird,
                 odd       IMDb Movie reviews
                                                   amazing,
               amazing,      (Training data)       awesome,
               awesome                               perfect,
                                                    fantastic

Vocabulary                                                Twitter comments
                                                               analysis
             applied to
                           Movie tweets
                            (Test data)




                                                                ailab.ijs.si
Domain Driven Opinion
                      Vocabulary
Given a positive word seed list and
a negative word seed list, expand
the initial seed lists using synonymy                          Domain-specific
/ antonymy relations (WordNet)                                 opinion
                                                               vocabulary
   the initial words will be assigned a
   score of 1 for positive words and -1 for
   negative words                                                Weird, odd,
Given a corpus, parse and extract                                   bad
                                              2 Clusters
all adjectives and conjunctions –              Weird,
obtain a graph with two types of                odd               amazing,
relationships between nodes:                                      awesome,
                                              amazing,
   same context orientation (words            awesome               perfect,
   connected by and, or, nor) or                                   fantastic
   opposite context orientation (words
   connected by but, yet)
Clean the resulting set of words and
relationship graph by removing stop
words and self-reference relations
                                                           ailab.ijs.si
Domain Driven Opinion
                      Vocabulary
 Some of the characters are fictitious, but not grotesque
 synonym

  fictional
                                          fictitious
    real                                score(“fictitious”) = max(s_syn, s_ant)
                                        score(“fictitious”) = s_syn
 antonym                                score(“fictitious”) = - 0.3874 if f = 0.9

fictitious but grotesque – ContextOpposite relationship

fictitious but not grotesque – ContextSame relationship


                                                              ailab.ijs.si
Domain Driven Opinion
                   Vocabulary
4. Propagate scores through the graph, by
   determining for each word w
     Positivity score sPos
     Negativity score sNeg

      if relij is a ContextSame relation
          sPos(wi) += weigth(relij) * prevSPos(wj)
          sNeg (wi) += weigth(relij) * prevSNeg(wj)
      else if relij is a ContextOpposite relation
          sPos(wi) += weigth(relij) * prevSNeg(wj)
          sNeg (wi) += weigth(relij) * prevSPos(wj)



                                                      ailab.ijs.si
Use Case: Twitter Movie Reviews

domain specific document corpus of 27,886 IMDb
movie reviews
domain specific opinion vocabulary:
   9,318 words: 4,925 have a negative orientation and 4393
   have a positive orientation

            Inception (2010)                  Meet the Spartans (2008)
Positive words: good, great,           Positive words: funny, awesome,
awesome, amazing, favorite, fantastic, great
incredible, thrilling, different,
speechless                             Negative words: bad, stupid, dumb,
                                       weird, silly, common, ridiculous,
Negative words: bad, confusing,        terrible
weird, stupid, dumb, boring,
predictable, horrible, disappointing

                                                              ailab.ijs.si
Use Case: Twitter Movie Reviews

220,387 tweets crawled over a two month
interval, keyed on 84 movies
     Movie               Genre           Tweets   IMDb       Our
                                                  score     score
Inception (2010)      mystery, sci-fi,   19,256    8.9      66.52
                         thriller
  Megamind         animation, comedy,    8,109     7.3      67.71
    (2010)               family
 Unstoppable          drama, thriller    15,349    7        63.67
    (2010)
  Burlesque          drama, music,       1,244     6.2      70.78
    (2010)              romance
   Meet the           comedy, war          44      2.5      40.67
Spartans (2008)

                                                         ailab.ijs.si
Twitter comments
          analysis

•   Sentiment words
    distribution for a movie
•   Sentiment orientation
    evolution per week, day,
    hour
•   Movie comparison




       ailab.ijs.si
Conclusions and Future Work
identifying opinion diversity expressed within text,
with the aid of a domain-specific vocabulary
processing a corpus of IMDb movie reviews,
generated an opinion lexicon and analyzed a
different opinion source corpus, i.e. a tweet collection

further extend our algorithm to include opinion words
expressed by verbs and adverbs, as well as more
complex expressions
conduct experiments in order to
  determine the correlation between positive opinion words
  for a given movie and the IMDb movie rating
  evaluate the opinion lexicon directly

                                              ailab.ijs.si
Thank You for Your Attention!




                       ailab.ijs.si

More Related Content

More from RENDER project

Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
RENDER project
 
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja TrampusDiversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
RENDER project
 
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
RENDER project
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
RENDER project
 
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny VrandecicDiversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
RENDER project
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena Simperl
RENDER project
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
RENDER project
 
Diversity toolkit
Diversity toolkitDiversity toolkit
Diversity toolkit
RENDER project
 
RENDER Telefonica
RENDER TelefonicaRENDER Telefonica
RENDER Telefonica
RENDER project
 
Defining Diversity
Defining DiversityDefining Diversity
Defining Diversity
RENDER project
 
Render Project introduction and overview
Render Project introduction and overviewRender Project introduction and overview
Render Project introduction and overview
RENDER project
 

More from RENDER project (11)

Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
 
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja TrampusDiversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
 
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
 
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny VrandecicDiversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena Simperl
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
 
Diversity toolkit
Diversity toolkitDiversity toolkit
Diversity toolkit
 
RENDER Telefonica
RENDER TelefonicaRENDER Telefonica
RENDER Telefonica
 
Defining Diversity
Defining DiversityDefining Diversity
Defining Diversity
 
Render Project introduction and overview
Render Project introduction and overviewRender Project introduction and overview
Render Project introduction and overview
 

Recently uploaded

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
Claudio Di Ciccio
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
Techgropse Pvt.Ltd.
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 

Recently uploaded (20)

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 

Diversiweb2011 04 Expressing Opinion Diversity - Delia Rusu

  • 1. Expressing Opinion Diversity Andreea Bizău – Babeș Bolyai University, Cluj-Napoca, Romania Delia Rusu, Dunja Mladenić - Jožef Stefan Institute, Ljubljana, Slovenia ailab.ijs.si
  • 3. Overview Terminology Related work Domain Driven Opinion Vocabulary Use Case: Twitter Movie Comments Conclusions and Future Work ailab.ijs.si
  • 4. Terminology opinion - a subjective expression of sentiments, appraisals or feelings opinion words - a set of keywords/phrases used in expressing an opinion orientation of an opinion word indicates whether the opinion expressed is positive, negative or neutral the totality of opinion words forms an opinion vocabulary ailab.ijs.si
  • 5. Related Work – Opinion Vocabulary Dictionary based Esuli and Sebastiani, 2006 - SentiWordNet: three numerical scores (objective, positive, negative) Corpus based Kanayama and Nasukawa, 2006 - context coherency (same polarity tend to appear successively) Jialin Pan et al, 2010 - feature bipartite graph modeling the relationship between domain-specific words and domain- independent words. Dictionary and Corpus based Jijkoun et al, 2010 - dependency parsing on a set of relevant documents, extracting patterns of the form (clue word, syntactic context, target of sentiment) ailab.ijs.si
  • 6. Related Work – Opinion Analysis Approaches: Hatzivassiloglou et al, 1997 – relevance of using connectives (conjunctions: and, or, but, etc) Kim and Hovy, 2004 – use word seed lists and WordNet synsets to determine the strength of the opinion orientation for the identified opinion words Gamon and Aue, 2005 – assign opinion orientation to candidate words, assuming that opinion terms with similar orientation tend to co-occur Twitter Applications: Pang and Lee, 2002 – classifying movie reviews by overall document sentiment Asur and Huberman, 2010 – sentiments extracted from Twitter can be used to build a prediction model for box- office revenue ailab.ijs.si
  • 7. Our Approach Domain-specific IMDb Movie reviews opinion (sample) vocabulary Weird, odd, 2 Clusters bad Weird, odd IMDb Movie reviews amazing, amazing, (Training data) awesome, awesome perfect, fantastic Vocabulary Twitter comments analysis applied to Movie tweets (Test data) ailab.ijs.si
  • 8. Domain Driven Opinion Vocabulary Given a positive word seed list and a negative word seed list, expand the initial seed lists using synonymy Domain-specific / antonymy relations (WordNet) opinion vocabulary the initial words will be assigned a score of 1 for positive words and -1 for negative words Weird, odd, Given a corpus, parse and extract bad 2 Clusters all adjectives and conjunctions – Weird, obtain a graph with two types of odd amazing, relationships between nodes: awesome, amazing, same context orientation (words awesome perfect, connected by and, or, nor) or fantastic opposite context orientation (words connected by but, yet) Clean the resulting set of words and relationship graph by removing stop words and self-reference relations ailab.ijs.si
  • 9. Domain Driven Opinion Vocabulary Some of the characters are fictitious, but not grotesque synonym fictional fictitious real score(“fictitious”) = max(s_syn, s_ant) score(“fictitious”) = s_syn antonym score(“fictitious”) = - 0.3874 if f = 0.9 fictitious but grotesque – ContextOpposite relationship fictitious but not grotesque – ContextSame relationship ailab.ijs.si
  • 10. Domain Driven Opinion Vocabulary 4. Propagate scores through the graph, by determining for each word w Positivity score sPos Negativity score sNeg if relij is a ContextSame relation sPos(wi) += weigth(relij) * prevSPos(wj) sNeg (wi) += weigth(relij) * prevSNeg(wj) else if relij is a ContextOpposite relation sPos(wi) += weigth(relij) * prevSNeg(wj) sNeg (wi) += weigth(relij) * prevSPos(wj) ailab.ijs.si
  • 11. Use Case: Twitter Movie Reviews domain specific document corpus of 27,886 IMDb movie reviews domain specific opinion vocabulary: 9,318 words: 4,925 have a negative orientation and 4393 have a positive orientation Inception (2010) Meet the Spartans (2008) Positive words: good, great, Positive words: funny, awesome, awesome, amazing, favorite, fantastic, great incredible, thrilling, different, speechless Negative words: bad, stupid, dumb, weird, silly, common, ridiculous, Negative words: bad, confusing, terrible weird, stupid, dumb, boring, predictable, horrible, disappointing ailab.ijs.si
  • 12. Use Case: Twitter Movie Reviews 220,387 tweets crawled over a two month interval, keyed on 84 movies Movie Genre Tweets IMDb Our score score Inception (2010) mystery, sci-fi, 19,256 8.9 66.52 thriller Megamind animation, comedy, 8,109 7.3 67.71 (2010) family Unstoppable drama, thriller 15,349 7 63.67 (2010) Burlesque drama, music, 1,244 6.2 70.78 (2010) romance Meet the comedy, war 44 2.5 40.67 Spartans (2008) ailab.ijs.si
  • 13. Twitter comments analysis • Sentiment words distribution for a movie • Sentiment orientation evolution per week, day, hour • Movie comparison ailab.ijs.si
  • 14. Conclusions and Future Work identifying opinion diversity expressed within text, with the aid of a domain-specific vocabulary processing a corpus of IMDb movie reviews, generated an opinion lexicon and analyzed a different opinion source corpus, i.e. a tweet collection further extend our algorithm to include opinion words expressed by verbs and adverbs, as well as more complex expressions conduct experiments in order to determine the correlation between positive opinion words for a given movie and the IMDb movie rating evaluate the opinion lexicon directly ailab.ijs.si
  • 15. Thank You for Your Attention! ailab.ijs.si