Review of Natural Language Processing tasks and examples of why it is so hard. Then he describes in detail text categorization and particularly sentiment analysis. A few common approaches for predicting sentiment are discussed, going even further, explaining statistical machine learning algorithms.
This Project Aimed at doing a comprehensive study of Different Machine Learning Approaches on Sentiment Analysis of Movie Reviews. Support Vector Machines were the one that Performed Most Accurately with Radial Basis Function. Lots of Other kernel functions and Kernel Parameters were tried to find the optimal one. We achieved accuracy up to 83%.
Review of Natural Language Processing tasks and examples of why it is so hard. Then he describes in detail text categorization and particularly sentiment analysis. A few common approaches for predicting sentiment are discussed, going even further, explaining statistical machine learning algorithms.
This Project Aimed at doing a comprehensive study of Different Machine Learning Approaches on Sentiment Analysis of Movie Reviews. Support Vector Machines were the one that Performed Most Accurately with Radial Basis Function. Lots of Other kernel functions and Kernel Parameters were tried to find the optimal one. We achieved accuracy up to 83%.
Sentiment analysis - Our approach and use casesKarol Chlasta
I. Introduction to Sentiment Analysis and its applications.
II. How to approach Sentiment Analysis?
III. 2015 Elections in Poland on Twitter.com & Onet.pl.
It gives an overview of Sentiment Analysis, Natural Language Processing, Phases of Sentiment Analysis using NLP, brief idea of Machine Learning, Textblob API and related topics.
Deep Learning approaches for Hate speech detection. In this work we used the two deep learning approaches DCNN and MLP two separate classifier on four publicly available datasets.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
Sentiment analysis and opinion mining is almost same thing however there is minor difference between them that is opinion mining extracts and analyze people's opinion about an entity while Sentiment analysis search for the sentiment words/expression in a text and then analyze it.
It uses machine learning techniques like SVM (Support Vector Machines) to analyze the text and classify them as positive, negative or neutral.
Sentiment analysis - Our approach and use casesKarol Chlasta
I. Introduction to Sentiment Analysis and its applications.
II. How to approach Sentiment Analysis?
III. 2015 Elections in Poland on Twitter.com & Onet.pl.
It gives an overview of Sentiment Analysis, Natural Language Processing, Phases of Sentiment Analysis using NLP, brief idea of Machine Learning, Textblob API and related topics.
Deep Learning approaches for Hate speech detection. In this work we used the two deep learning approaches DCNN and MLP two separate classifier on four publicly available datasets.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
Sentiment analysis and opinion mining is almost same thing however there is minor difference between them that is opinion mining extracts and analyze people's opinion about an entity while Sentiment analysis search for the sentiment words/expression in a text and then analyze it.
It uses machine learning techniques like SVM (Support Vector Machines) to analyze the text and classify them as positive, negative or neutral.
Sentiment analysis, also known as opinion mining, is a field of computer science that focuses on automatically identifying the opinions and feelings expressed in text, audio and video. It aims to determine whether a document expresses a subjective view (positive, negative, or neutral) or presents objective facts.
Sentiment analysis involves determining the sentiment expressed by a writer in a document. The objective of the opinion-mining field is to conduct subjectivity analysis, indicating whether a document is subjective or objective. Subjectivity implies the presence of sentiment, while objectivity signifies content devoid of sentiment. Currently, an abundance of information about a specific product is available, with a single product often garnering hundreds of reviews across various webpages. Numerous websites, such as imdb.com, amazon.com, idlebrain.com, among others, aggregate user information and expert opinions to publish reviews. Experts meticulously analyze reviews, extract opinions, and generate ratings related to the dataset provided by the requesting agencies. However, handling the vast amount of data is a labor-intensive task for experts. The continuously growing volume of web data poses challenges in extracting precise opinions from content. Hence, there is a need to design a system that can efficiently perform these tasks with human-like accuracy.
In this research work, the propose approach enough capable of handling and analyzing large amounts of reviews. The reviews considered of analyzing are pre-analyzed with existing algorithms and further processed through the approach proposed in the present research work. The working capacity of the proposed approach extracts sentiment from the available content (dataset) and determines polarity degree using sentiment polarity and degree management. It also measures sentiment degrees based on user-provided target document features. The outcome is a summary comprising highly sentiment-related sentences, providing valuable insights to the users. The goal is to streamline sentiment analysis processes and enhance accuracy in a manner that aligns with human-like comprehension.
Sentiment analysis, also known as opinion mining, is a field of computer science that focuses on automatically identifying the opinions and feelings expressed in text, audio and video. It aims to determine whether a document expresses a subjective view (positive, negative, or neutral) or presents objective facts.
Sentiment analysis involves determining the sentiment expressed by a writer in a document. The objective of the opinion-mining field is to conduct subjectivity analysis, indicating whether a document is subjective or objective. Subjectivity implies the presence of sentiment, while objectivity signifies content devoid of sentiment. Currently, an abundance of information about a specific product is available, with a single product often garnering hundreds of reviews across various webpages. Numerous websites, such as imdb.com, amazon.com, idlebrain.com, among others, aggregate user information and expert opinions to publish reviews. Experts meticulously analyze reviews, extract opinions, and generate ratings related to the dataset provided by the requesting agencies. However, handling the vast amount of data is a labor-intensive task for experts. The continuously growing volume of web data poses challenges in extracting precise opinions from content. Hence, there is a need to design a system that can efficiently perform these tasks with human-like accuracy.
In this research work, the propose approach enough capable of handling and analyzing large amounts of reviews. The reviews considered of analyzing are pre-analyzed with existing algorithms and further processed through the approach proposed in the present research work. The working capacity of the proposed approach extracts sentiment from the available content (dataset) and determines polarity degree using sentiment polarity and degree management. It also measures sentiment degrees based on user-provided target document features. The outcome is a summary comprising highly sentiment-related sentences, providing valuable insights to the users. The goal is to streamline sentiment analysis processes and enhance accuracy in a manner that aligns with human-like comprehension.
Business intelligence analytics using sentiment analysis-a surveyIJECEIAES
Sentiment analysis (SA) is the study and analysis of sentiments, appraisals and impressions by people about entities, person, happening, topics and services. SA uses text analysis techniques and natural language processing methods to locate and extract information from big data. As most of the people are networked themselves through social websites, they use to express their sentiments through these websites.These sentiments are proved fruitful to an individual, business, government for making decisions. The impressions posted on different available sources are being used by organization to know the market mood about the services they are providing. Analyzing huge moods expressed with different features, style have raised challenge for users. This paper focuses on understanding the fundamentals of sentiment analysis, the techniques used for sentiment extraction and analysis. These techniques are then compared for accuracy, advantages and limitations. Based on the accuracy for expexted approach, we may use the suitable technique.
Multiple Methods and Techniques in Analyzing Computer-Supported Collaborative...CITE
5 March 2010 (Friday) | 09:00 - 12:30 | http://citers2010.cite.hku.hk/abstract/69 | Dr. Kwok Ping CHAN, Associate Professor, Department of Computer Science, HKU
Aspect Level Sentiment Analysis for Arabic LanguageMido Razaz
This is the presentation I used in my proposal seminar for master degree in ISSR.
the thesis about Aspect Level Sentiment Classification for Arabic Language.
Any further info. please contact me at (razaz_2006@hotmail.com)
One fundamental problem in sentiment analysis is categorization of sentiment polarity. Given a piece of written text, the problem is to categorize the text into one specific sentiment polarity, positive or negative (or neutral). Based on the scope of the text, there are three distinctions of sentiment polarity categorization, namely the document level, the sentence level, and the entity and aspect level. Consider a review “I like multimedia features but the battery life sucks.†This sentence has a mixed emotion. The emotion regarding multimedia is positive whereas that regarding battery life is negative. Hence, it is required to extract only those opinions relevant to a particular feature (like battery life or multimedia) and classify them, instead of taking the complete sentence and the overall sentiment. In this paper, we present a novel approach to identify pattern specific expressions of opinion in text.
Essay On Sentiment Analysis
Billy Collins Introduction To Poetry Analysis
Team Reflective Analysis
Marketing Analysis Essay
Introduction of Toyota Motor Corporation
Situational Analysis Essay
Examples Of Semiotic Analysis
Poetry Analysis of "Introduction to Poetry"
Group Analysis Essay
Introductory Paragraph Analysis
Environmental Analysis Essay
Essay Personal Swot
Essay on Self-Analysis
Introduction To Poetry Billy Collins Analysis
Describe A Place Analysis
Organizational Analysis Essay
Billy Collins Introduction To Poetry Analysis
Basic introduction of opinion mining and sentiment analysis , challenges faced during opinion mining, Sentiment classification at various levels and application of mining and sentiment analysis
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...2023240532
Quantitative data Analysis
Overview
Reliability Analysis (Cronbach Alpha)
Common Method Bias (Harman Single Factor Test)
Frequency Analysis (Demographic)
Descriptive Analysis
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
3. What is Sentiment Analysis?
● Sentiment analysis is the field of study that analyzes people’s opinions,
sentiments, evaluations, appraisals, attitudes, and emotions towards
entities such as products, services, organizations, individuals, issues, events,
topics, and their attributes.
● Also known as Opinion Mining, Opinion Extraction, Sentiment Mining,
Subjectivity Analysis, Emotion Analysis.
4. Sentiment Analysis~NLP
❖ Sentiment analysis is a highly restricted NLP problem because the system
does not need to fully understand the semantics of each document but only
needs to understand some aspects of it, i.e., positive or negative sentiments
and their target entities or topics.
5. Sentiment Analysis Applications
❖ No longer be necessary to conduct surveys, opinion polls, and focus groups
in order to gather public opinions.
❖ Customer feedback collected from emails and call centers or results from
surveys conducted by the organizations.
6. Different Levels of Analysis
❖ Document level
❖ Sentence level
❖ Entity and Aspect level
7. Document Level
❖ The task at this level is to classify whether a whole opinion document
expresses a positive or negative sentiment.
❖ For example, given a product review, the system determines whether the
review expresses an overall positive or negative opinion about the product.
8. Sentence Level
❖ Determines whether each sentence expressed a positive, negative, or neutral
opinion (Neutral usually means no opinion).
❖ Example, “I love OnePlus 7T.” - positive sentiment and “I didn’t like the
packaging of device.” - negative sentiment.
9. Entity and Aspect Level
❖ It is based on the idea that an opinion consists of a sentiment (positive or
negative) and a target (of opinion).
❖ For Example, “The iPhone’s call quality is good, but its battery life is short.”
➢ Aspect: call quality and battery life, Entity: iPhone.
➢ The sentiment on iPhone’s call quality is positive, but the sentiment on its battery life is
negative.
➢ The call quality and battery life of iPhone are the opinion targets.
10. Sentiment Lexicon
❖ The most important indicators of sentiments are sentiment words,
➢ Example, good, wonderful, and amazing are positive sentiment words and bad, poor, and
terrible are negative sentiment words.
❖ Apart from individual words, there are also phrases and idioms,
➢ Example, cost someone an arm and a leg.
❖ A list of such words and phrases is called a sentiment lexicon.
❖ Sentiment lexicon is necessary but not sufficient for sentiment analysis.
11. Sentiment Lexicon- Issue 1
❖ A +ive or -ive sentiment word may have opposite orientations in different
application domains.
❖ Example, “suck” usually indicates negative sentiment,but
“This camera sucks.”
“This vacuum cleaner really sucks.”
12. Sentiment Lexicon- Issue 2
❖ A sentence containing sentiment words may not express any sentiment.
❖ Question (interrogative) sentences and conditional sentences are two
important types,
❖ Example,
“Can you tell me which Sony camera is good ?”
“If I can find a good camera in the shop, I will buy it.”
13. Sentiment Lexicon- Issue 3
❖ Sarcastic sentences with or without sentiment words are hard to deal with.
❖ Sarcasms are not so common in consumer reviews about products and
services, but are very common in political discussions, which make political
opinions hard to deal with.
❖ Example,
“What a great car! It stopped working in two days.”
14. Sentiment Lexicon- Issue 4
❖ Many sentences without sentiment words can also imply opinions.
❖ Many of these sentences are actually objective sentences that are used to
express some factual information.
❖ Example,
“This washer uses a lot of water.”
expresses a negative sentiment about the washer since it uses a lot of resource (water)
“After sleeping on the mattress for two days, a valley has formed in the middle.”
expresses a negative opinion about the mattress.
15. Problem Definition
❖ We use the following review about a Canon camera to introduce the problem.
❖ From this review, we notice a few important points.
16. Analysis of Review..
The review has a number of opinions, both positive and negative, about Canon
G12 camera.
❖ Sentence (2) - a positive opinion about the Canon camera as a whole.
❖ Sentence (3) - a positive opinion about its picture equality.
❖ Sentence (4) - a positive opinion about its battery life.
❖ Sentence (5) - a negative opinion about the weight of the camera.
17. Analysis of Review(Definition of Opinion as Tuple)..
Definition: An opinion consists of two key components:
( g, s)
where
g - target (Any entity about which an opinion has been expressed)
s - sentiment, (strength/intensity or type of the sentiment.)
Example, the target of the opinion in sentence (2) - Canon G12.
18. Analysis of Review(Issues with this Definition)..
● The holder of the opinions in sentences (2), (3), and (4) is the author of the
review (“ John Smith” ), but for sentence (5), it is the wife of the author.
● The date of the review is September 10, 2011. This date is important in
practice because one often wants to know how opinions change with time
and opinion trends.
19. Analysis of Review(Opinion as a Quadruple)..
Definition: An opinion is a quadruple,
( g, s, h, t)
where,
g - opinion (or sentiment) target,
s - sentiment about the target,
h - opinion holder
t - time when the opinion was expressed.
20. Analysis of Review(Improvement)..
❖ Observation: In sentence (3), the opinion target is actually “picture quality of
Canon G12,” but the sentence mentioned only “picture quality.”
❖ Solution: The target can often be decomposed and described in a structured
manner with multiple levels, which greatly facilitate both mining of opinions
and later use of the mined opinion results.
❖ For example, “picture quality of Canon G12” ~ (Canon-G12, picture-quality).
21. Analysis of Review(Entity)..
Definition: An entity e is a product, service, topic, issue, person, organization, or
event. It is described with a pair,
e: (T, W)
where,
T - a hierarchy of parts, sub-parts, and so on,
W - a set of attributes of e.
Example, For a Canon G12. It has a set of attributes, e.g., picture quality, size, and
weight, and a set of parts, e.g., lens, viewfinder, and battery.
22. Analysis of Review(Improvement)..
❖ Observation: Sentence (2) expresses a positive opinion about the entity
Canon G12 camera as a whole.
Sentence (3) expresses a positive opinion on the attribute of picture quality of
the camera. Clearly, one can also express opinions about parts or
components of the camera.
❖ Solution: We simplify the hierarchy to two levels and use the term aspects to
denote both parts and attributes. The root node is still the entity itself, but
the second level (also the leaf level ) nodes are different aspects of the entity.
23. Analysis of Review(Quintuple Definition of Opinion)..
Definition: An opinion is a quintuple,
(ei
, aij
, sijkl
, hk
, tl
)
where
ei
is the name of an entity,
aij
is an aspect of ei
,
sijkl
is the sentiment on aspect aij
of entity ei
hk
is the opinion holder,
tl
is the time when the opinion is expressed by hk
.
24. Quintuple Definition of Opinion(Important Points)
❖ We purposely use subscripts. The opinion sijkl
must be given by opinion
holder hk
about aspect aij
of entity ei
at time tl
.
❖ The opinion defined here is just one type of opinion, called regular opinion.
Another type is comparative opinion, which needs a different definition.
❖ This definition provides a framework to transform unstructured text to
structured data.
❖ The definition covers most but not all possible facets of the semantic
meaning of an opinion, which can be arbitrarily complex. For example, in
situation “The viewfinder and the lens are too close.” which expresses an
opinion but does not cover the context of the opinion.
25. Objective of Sentiment Analysis
Given an opinion document d, discover all opinion quintuples (ei
, aij
, sijkl
, hk
, tl
)
in d.
26. Problem in Extraction of Quintuple
● For example, Motorola may be written as Mot, Moto, and Motorola. We need
to recognize that they all refer to the same entity.
Definition: An entity category represents a unique entity, while an entity
expression is an actual word or phrase that appears in the text indicating an
entity category.
27. Problem in Extraction of Quintuple
● For example, picture, image, and photo are the same aspect for cameras. We
thus need to extract aspect expressions and categorize them.
Definition: An aspect category of an entity represents a unique aspect of the
entity, while an aspect expression is an actual word or phrase that appears in
the text indicating an aspect category.
28. Aspect Expressions
Aspect expressions are usually nouns and noun phrases but can also be verbs,
verb phrases, adjectives, and adverbs.
❖ Explicit Aspect Expression: Aspect expressions that are nouns and noun
phrases
➢ Example, “picture quality” in “The picture quality of this camera is great ”
❖ Implicit Aspect Expression: Aspect expressions that are not nouns or noun
phrases
➢ Example, “expensive” is an implicit aspect expression in “This camera is expensive.”
29. Summarization
Task 1 (entity extraction and categorization): Extract all entity expressions in D,
and categorize or group synonymous entity expressions into entity clusters (or
categories). Each entity expression cluster indicates a unique entity ei
.
Task 2 (aspect extraction and categorization): Extract all aspect expressions of
the entities, and categorize these aspect expressions into clusters. Each aspect
expression cluster of entity ei
represents a unique aspect aij
.
Task 3 (opinion holder extraction and categorization): Extract opinion holders
for opinions from text or structured data and categorize them. The task is
analogous to the above two tasks.
30. Summerization(Cont.)..
Task 4 (time extraction and standardization): Extract the times when opinions
are given and standardize different time formats. The task is also analogous to
the above tasks.
Task 5 (aspect sentiment classification): Determine whether an opinion on an
aspect aij
is positive, negative or neutral, or assign a numeric sentiment rating to
the aspect.
Task 6 (opinion quintuple generation): Produce all opinion quintuples (ei
, aij
, sijkl
,
hk
, tl
) expressed in document d based on the results of the above tasks.
32. Solution(1)..
1. Task 1 should extract the entity expressions, “Samsung,” “Samy,” and
“Canon,” and group “Samsung” and “Samy” together as they represent the
same entity.
2. Task 2 should extract aspect expressions “picture,” “photo,” and “battery
life,” and group “picture” and “photo” together as for cameras they are
synonyms.
3. Task 3 should find the holder of the opinions in sentence (3) to be big John
(the blog author) and the holder of the opinions in sentence (4) to be big
John’s friend.
33. Solution(2)..
● Task 4 should also find the time when the blog was posted is Sept-15-2011.
● Task 5 should find that sentence (3) gives a negative opinion to the picture
quality of the Samsung camera and also a negative opinion to its battery life.
Sentence (4) gives a positive opinion to the Canon camera as a whole and also
to its picture quality. Sentence (5) seemingly expresses a positive opinion, but
it does not.
34. Solution(3)..
(Samsung, picture_quality, negative, big John, Sept-15-2011)
(Samsung, battery_life, negative, big John, Sept-15-2011)
(Canon, GENERAL, positive, big John’s_friend, Sept-15-2011)
(Canon, picture_quality, positive, big John’s_friend, Sept-15-2011)
35. Types of Opinions - Regular & Comparative Opinions
❖ Regular Opinions: A sentiment only on an particular entity or an aspect of
the entity,
➢ For Example, “Coke tastes very good.” which expresses a positive sentiment on the aspect
taste of Coke.
❖ Comparative Opinions: A comparative opinion compares multiple entities
based on some of their shared aspects,
➢ For Example, “Coke tastes better than Pepsi.” which compares Coke and Pepsi based on their
tastes (an aspect) and expresses a preference for Coke.
36. Type of Regular Opinions- Regular & Indirect Opinion
❖ Direct opinion: A direct opinion refers to an opinion expressed directly on
an entity or an entity aspect. Example, “The picture quality is great.”
❖ Indirect opinion: An indirect opinion is an opinion that is expressed
indirectly on an entity or aspect of an entity based on its effects on some
other entities.
➢ Example, the sentence “After injection of the drug, my joints felt worse” describes an
undesirable effect of the drug on “my joints”, which indirectly gives a negative opinion or
sentiment to the drug.
37. Type of Opinions- Explicit and Implicit Opinions
❖ Explicit opinion: An explicit opinion is a subjective statement that gives a
regular or comparative opinion.
➢ Example, “Coke tastes great,” and “Coke tastes better than Pepsi.”
❖ Implicit (or implied) opinion: An implicit opinion is an objective statement
that implies a regular or comparative opinion.
➢ Example, “I bought the mattress a week ago, and a valley has formed,” and “The battery life of
Nokia phones is longer than Samsung phones.”
Explicit opinions are easier to detect and to classify than implicit opinions.
38. Subjectivity Classification
❖ Definition: The task of determining whether a sentence is subjective or
objective.
❖ An objective sentence presents some factual information about the world,
while a subjective sentence expresses some personal feelings, views, or beliefs.
➢ Example, objective sentence is “iPhone is an Apple product.” and subjective sentence is “I like
iPhone.”
39. Subjectivity and Opinionated
❖ Subjectivity totally unrelated to being opinionated. By opinionated, we mean
that a document or sentence expresses or implies a positive or negative
sentiment.
❖ A subjective sentence may not express any sentiment.
➢ Example, “I think that he went home” is a subjective sentence, but does not express any
sentiment.
❖ Objective sentences can imply opinions or sentiments due to desirable and
undesirable facts.
➢ Example, “The earphone broke in two days.” and “I brought the mattress a week ago and a
valley has form.”
40. Emotions
❖ Definition: Emotions are our subjective feelings and thoughts.
❖ The strength of a sentiment or opinion is typically linked to the intensity of
certain emotions, e.g., joy and anger.
❖ Opinions that we study in sentiment analysis are mostly evaluations.
❖ According to consumer behavior research, evaluations can be broadly
categorized into two types: rational evaluations and emotional evaluations.
41. Rational Evaluation & Emotional Evaluation
❖ Definition (Rational Evaluation): Such evaluations are from rational
reasoning, tangible beliefs, and utilitarian attitudes.
➢ Example, “The voice of this phone is clear,” “This car is worth the price,”
❖ Definition(Emotional Evaluation): Such evaluations are from non-tangible
and emotional responses to entities which go deep into people’s state of
mind.
➢ Example, “I love iPhone,” “I am so angry with their service people”
42. Document Sentiment Classification
Given an opinion document d evaluating an entity, determine the overall sentiment s
of the opinion holder about the entity, i.e., determine s expressed on aspect GENERAL
in the quintuple
(_, GENERAL, s, _, _ ),
where the entity e, opinion holder h, and time of opinion t are assumed known or
irrelevant (do not care).
Assumption: Sentiment classification or regression assumes that the opinion document d (e.g., a product review)
expresses opinions on a single entity e and contains opinions from a single opinion holder h.
43. Sentiment Classification Using Supervised Learning
❖ Like other supervised machine learning applications, the key for sentiment
classification is the engineering of a set of effective features.
❖ Training and testing data used are normally product reviews.
❖ Some of the example features are the following:
➢ Terms and their frequency
➢ Part of speech.
➢ Sentiment words and phrases
➢ Sentiment shifters
44. Terms and their Frequency
❖ These features are individual words (unigram) and their n-grams with
associated frequency counts.
❖ In some cases, word positions may also be considered.
❖ The TF-IDF Weighting Scheme (tf-idf is a weighting scheme that assigns each term in a
document a weight based on its term frequency (tf) and inverse document frequency (idf). The terms
with higher weight scores are considered to be more important. ) from information retrieval
may be applied too.
45. Part of Speech
❖ Words of different parts of speech (POS) may be treated differently.
❖ For example, it was shown that adjectives are important indicators of
opinions.
❖ However, one can also use all POS tags and their n-grams as features.
46.
47. Sentiment words and Phrases
❖ Sentiment words are words in a language that are used to express positive or
negative sentiments.
❖ For example, good, wonderful, and amazing are positive sentiment words,
and bad, poor, and terrible are negative sentiment words.
❖ Apart from individual words, there are also sentiment phrases and idioms,
e.g., cost someone an arm and a leg.
48. Sentiment Shifter
❖ These are expressions that are used to change the sentiment orientations,
e.g., from positive to negative or vice versa.
❖ Negation words are the most important class of sentiment shifters. Example,
the sentence “I don’t like this camera ” is negative.
❖ Suchshifters also need to be handled with care because not all occurrences
of such words mean sentiment changes. For example, “not” in “not only . . .
but also” does not change sentiment orientation.
49. Sentiment Analysis Using Unsupervised Learning
❖ Classification based on some fixed syntactic patterns that are likely to be
used to express opinions.
❖ The syntactic patterns are composed based on part-of-speech (POS)
tags.The algorithm given in Turney (2002) consists of three steps.
50. Step -1
❖ Two consecutive words are extracted if their POS tags conform to any of the
patterns. See Table.The nouns or verbs act as the contexts because in
different contexts a JJ, RB, RBR and RBS word may express different
sentiments.
❖ As an example, in the sentence “This piano produces beautiful sounds
”,“beautiful sounds ” is extracted as it satisfies the first pattern.
Note: The adjective (JJ ) “unpredictable” may have a negative sentiment in a car review as in “unpredictable steering,” but
it could have a positive sentiment in a movie review as in “unpredictable plot.”
51.
52. Step 2
❖ It estimates the sentiment orientation (SO) of the extracted phrases using the
pointwise mutual information ( PMI ) measure:
Pr(term 1 U term 2) is the actual co-occurrence probability of term 1 and term 2 ,
Pr ( term 1 )Pr ( term 2 ) is the co-occurrence probability of the two terms
❖ PMI measures the degree of statistical dependence between two terms.
53. Step 2(Cont..)
❖ The sentiment orientation (SO ) of a phrase is computed based on its
association with the positive reference word “excellent” and the negative
reference word “poor ”:
SO(phrase) = PMI (phrase, “excellent”) - PMI(phrase, “poor”)
54. Step 3
Given a review, the algorithm computes the average SO of all phrases in the
review and classifies the review as positive if the average SO is positive and
negative otherwise.