SlideShare a Scribd company logo
Xiaowen Ding, Bing Liu and Philip Yu




Presenter: Quang Nguyen
Date: 2010.10.18
Saltlux Vietnam Development Center
   Featured-based Opinion Mining Tasks
    Task 1: Identify and extract object features F that have been
      commented on by an opinion holder (e.g., a reviewer).
    Task 2: Determine whether the opinions on the features F are
      positive, negative or neutral.
    Task 3: Group feature synonyms.
    • Produce a feature-based opinion summary of multiple reviews.


   This paper focuses on Task 2 assuming that features
    have been discovered


                                                                     2
 Opinion Words
  • Positive: beautiful, wonderful, good, amazing,
  • Negative: bad, poor, terrible, cost someone an
    arm and a leg (idiom).

 One effective approach is to use opinion lexicon,
 opinion words.
  • Identify all opinion words in a sentence
  • Aggregate these words to give the final opinion to
    each feature.



                                                         3
 Dictionary-based    approaches
  • Start from a seed opinion words
  • Use Wordnet’s hierarchy and synsets to acquire
   more opinion words
 Corpus-based approaches: extract opinion
 words from large corpora using syntactic
 rules and co-occurrence patterns

  Do not deal well with context dependent
  words!
                                                     4
 Improve lexicon-based approaches using
 context dependent opinion words
  • Negative: “The bedroom is very small”
  • Positive: “The Nokia N3100 is so small as to be
   put in any pockets”
 Propose  a function for aggregating multiple
  opinion words in the same sentence
 Consider explicit and implicit opinions



                                                      5
 Intra-sentence conjunction rule
 Pseudo intra-sentence conjunction
 Inter-sentence conjunction rule




                                      6
 Opinion
        on both sides of “and” should be
 the same
  • E.g., “This camera takes great pictures and has a
   long battery life”.

 Not   likely to say:
  • “This camera takes great pictures and has a short
   battery life.”



                                                        7
 Sometimes,  one may not use an explicit
 conjunction “and”.
  • Same opinion in same sentence, unless there is a
    “but”-like clause
  • E.g., “The camera has a long battery life, which is
    great”




                                                          8
 Peopleusually express the same opinion
 across sentences
  • unless there is an indication of opinion change
    using words such as “but” and “however”
  • E.g., “The picture quality is amazing. The battery life is
   long”
 Not   so natural to say:
  • “The picture quality is amazing. The battery life is
   short”



                                                                 9
 Opinion lexicon is far from sufficient. It needs
 special handling:
  • Negation/But Rule




  • Non-negation contains negative word, e.g., “I like this camera
    not just because it is beautiful”
  • Not contrary, but has a “but”, e.g., ““I not only like the picture
    quality of this camera, but also its size”
  • …



                                                                         10
 Implicit
         Feature is determined through
 adjectives (implicit feature indicator)
  • E.g., “This camera is very small”
  “small” is indicator for “size”
  • E.g., “This camera is very heavy”
  • “heavy” is indicator for “weight”




                                           11
 An object O is an entity which can be a product,
  person, event, organization, or topic
 An object O is represented with a finite set of features,
  F = {f1, f2, …, fn}.
    • Each feature fi in F can be expressed with a finite set of words
      or phrases Wi, which are synonyms.

   Model of a review: An opinion holder j comments on a
    subset of the features Sj F of object O.
    • For each feature fk   Sj that j comments on, he/she
       chooses a word or phrase from Wk to describe the
        feature, and
       expresses a positive, negative or neutral opinion on fk.

                                                                         12
   Input: a pair (f, s), where f is a product feature and s is a
    sentence that contains f.
   Output: whether the opinion on f in s is pos, neg, or neut.



                 wi: opinion word
                 V: set of all opinion words
                 dis(wi, f): distance between wi and f
                 SO: semantic orientation of wi (+1, -1, 0)




                                                                    13
14
15
Precision Recall F-Score
FBS
(M. Hu and B. Liu. Mining and         0.93    0.76    0.83
summarizing customer
reviews. KDD’04, 2004)

OPINE
(A-M. Popescu and O. Etzioni.
Extracting Product Features
                                      0.86    0.89    0.87
and Opinions from Reviews. EMNLP-
05, 2005.)
Opinion Observer                      0.92    0.91    0.91
(this paper)




                                                               16
   Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic
    Lexicon-Based Approach to Opinion Mining, Proceedings
    of the international conference on Web search and web
    data mining, USA, 2008




                                                            17
18

More Related Content

Similar to A holistic lexicon based approach to opinion mining

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
Eddie Hollon
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2ddelargy
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assess
iain bruce
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5Dave Eddy
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2
Machel Sweeney
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyDr Nur Suhaili Ramli
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
Kira
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5
kimpalmore
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringCS, NcState
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWholeeducation
 
Bagwords
BagwordsBagwords
Bagwords
mustafa sarac
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012Regina Navejar
 
Assignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransmanAssignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransman
Nelson Mandela Metropolitan University
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion mining
Sumit Sony
 
Reading Street
Reading StreetReading Street
Reading Street
cavalcic
 
Reading Street
Reading StreetReading Street
Reading Street
cavalcic
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summary
jschultz
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012Regina Navejar
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
firzhan naqash
 

Similar to A holistic lexicon based approach to opinion mining (20)

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assess
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research Methodology
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software Engineering
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacy
 
Bagwords
BagwordsBagwords
Bagwords
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012
 
Assignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransmanAssignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransman
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion mining
 
Reading Street
Reading StreetReading Street
Reading Street
 
Reading Street
Reading StreetReading Street
Reading Street
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summary
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
 

More from Nguyen Quang

Apache Zookeeper
Apache ZookeeperApache Zookeeper
Apache Zookeeper
Nguyen Quang
 
Apache Storm
Apache StormApache Storm
Apache Storm
Nguyen Quang
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
Nguyen Quang
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
Nguyen Quang
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Nguyen Quang
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
Nguyen Quang
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
Nguyen Quang
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginnerNguyen Quang
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrumNguyen Quang
 
Text categorization
Text categorizationText categorization
Text categorization
Nguyen Quang
 
Overview of NoSQL
Overview of NoSQLOverview of NoSQL
Overview of NoSQL
Nguyen Quang
 

More from Nguyen Quang (13)

Apache Zookeeper
Apache ZookeeperApache Zookeeper
Apache Zookeeper
 
Apache Storm
Apache StormApache Storm
Apache Storm
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
 
Eclipse orion
Eclipse orionEclipse orion
Eclipse orion
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginner
 
Html 5
Html 5Html 5
Html 5
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrum
 
Text categorization
Text categorizationText categorization
Text categorization
 
Overview of NoSQL
Overview of NoSQLOverview of NoSQL
Overview of NoSQL
 

Recently uploaded

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
Globus
 

Recently uploaded (20)

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
 

A holistic lexicon based approach to opinion mining

  • 1. Xiaowen Ding, Bing Liu and Philip Yu Presenter: Quang Nguyen Date: 2010.10.18 Saltlux Vietnam Development Center
  • 2. Featured-based Opinion Mining Tasks Task 1: Identify and extract object features F that have been commented on by an opinion holder (e.g., a reviewer). Task 2: Determine whether the opinions on the features F are positive, negative or neutral. Task 3: Group feature synonyms. • Produce a feature-based opinion summary of multiple reviews.  This paper focuses on Task 2 assuming that features have been discovered 2
  • 3.  Opinion Words • Positive: beautiful, wonderful, good, amazing, • Negative: bad, poor, terrible, cost someone an arm and a leg (idiom).  One effective approach is to use opinion lexicon, opinion words. • Identify all opinion words in a sentence • Aggregate these words to give the final opinion to each feature. 3
  • 4.  Dictionary-based approaches • Start from a seed opinion words • Use Wordnet’s hierarchy and synsets to acquire more opinion words  Corpus-based approaches: extract opinion words from large corpora using syntactic rules and co-occurrence patterns Do not deal well with context dependent words! 4
  • 5.  Improve lexicon-based approaches using context dependent opinion words • Negative: “The bedroom is very small” • Positive: “The Nokia N3100 is so small as to be put in any pockets”  Propose a function for aggregating multiple opinion words in the same sentence  Consider explicit and implicit opinions 5
  • 6.  Intra-sentence conjunction rule  Pseudo intra-sentence conjunction  Inter-sentence conjunction rule 6
  • 7.  Opinion on both sides of “and” should be the same • E.g., “This camera takes great pictures and has a long battery life”.  Not likely to say: • “This camera takes great pictures and has a short battery life.” 7
  • 8.  Sometimes, one may not use an explicit conjunction “and”. • Same opinion in same sentence, unless there is a “but”-like clause • E.g., “The camera has a long battery life, which is great” 8
  • 9.  Peopleusually express the same opinion across sentences • unless there is an indication of opinion change using words such as “but” and “however” • E.g., “The picture quality is amazing. The battery life is long”  Not so natural to say: • “The picture quality is amazing. The battery life is short” 9
  • 10.  Opinion lexicon is far from sufficient. It needs special handling: • Negation/But Rule • Non-negation contains negative word, e.g., “I like this camera not just because it is beautiful” • Not contrary, but has a “but”, e.g., ““I not only like the picture quality of this camera, but also its size” • … 10
  • 11.  Implicit Feature is determined through adjectives (implicit feature indicator) • E.g., “This camera is very small” “small” is indicator for “size” • E.g., “This camera is very heavy” • “heavy” is indicator for “weight” 11
  • 12.  An object O is an entity which can be a product, person, event, organization, or topic  An object O is represented with a finite set of features, F = {f1, f2, …, fn}. • Each feature fi in F can be expressed with a finite set of words or phrases Wi, which are synonyms.  Model of a review: An opinion holder j comments on a subset of the features Sj F of object O. • For each feature fk Sj that j comments on, he/she  chooses a word or phrase from Wk to describe the feature, and  expresses a positive, negative or neutral opinion on fk. 12
  • 13. Input: a pair (f, s), where f is a product feature and s is a sentence that contains f.  Output: whether the opinion on f in s is pos, neg, or neut. wi: opinion word V: set of all opinion words dis(wi, f): distance between wi and f SO: semantic orientation of wi (+1, -1, 0) 13
  • 14. 14
  • 15. 15
  • 16. Precision Recall F-Score FBS (M. Hu and B. Liu. Mining and 0.93 0.76 0.83 summarizing customer reviews. KDD’04, 2004) OPINE (A-M. Popescu and O. Etzioni. Extracting Product Features 0.86 0.89 0.87 and Opinions from Reviews. EMNLP- 05, 2005.) Opinion Observer 0.92 0.91 0.91 (this paper) 16
  • 17. Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic Lexicon-Based Approach to Opinion Mining, Proceedings of the international conference on Web search and web data mining, USA, 2008 17
  • 18. 18