SlideShare a Scribd company logo
1 of 72
Sentiment Analysis in
                     Social Media
                    How and Why




Davide Feltoni Gurini
                                                   1
Sistemi Intelligenti per Internet A.A. 2012/2013
About me and Contacts




1Β° Year Ph.D. in Computer Engineering at Roma Tre University
Via della Vasca Navale 79, Rome
A.I. Lab. Room 2.02a
Contacts:
       β€’ feltoni@dia.uniroma3.it
       β€’ http://about.me/davidefeltoni
                                                               2
Outline

οƒΌ Social Networks and Web 2.0
β€’ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
β€’ Sentiment analysis: an inside look
β€’ TwitterSA
β€’ TwitterSA Soccer Match Analysis
β€’ TwitterSA Predicting Elections



                                       3
Social Networks and Web 2.0
    Common Uses:
β€’    Spend time on internet!
β€’    Virtual network of friends
β€’    Gather info about something
β€’    Share people what we do
β€’    Know what other people do
β€’    Know what is happening in the world
β€’    …




                                           4
Social Networks and Web 2.0

Behind the scenes
β€’ Data Analyst
   β€’ Statistical studies
   β€’ How to collect and store big
     data
β€’ Business Analyst
   β€’ Social marketing
   β€’ Web Advertising
β€’ Web and App Developer
   β€’ Integrate social in games and
     apps
β€’ Ph.D. and Researcher
   β€’ Research methods and publish
     articles                        5
Social Networks and Web 2.0

    Evolution of Web

β€’ 1.0 (web for experts) 2.0 (web for
  everyone)
β€’ Users are not only reading
β€’ Web is a data container
β€’ Users now generate more content
    β€’    Blog, Social networks, write
         reviews, …
    β€’ Collaborative sites
       (Wikipedia, Communities,
       Forum)




                                        6
Social Networks: Overview of Italy




                                               7

Source: nielsen 2012
Social Networks: Some Statistics

                 Facebook monthly active users now total nearly 850 million



                      People spent about 11 hour at month considering only Facebook



                         There are 175 million tweets sent from Twitter every day in 2012



                         The 2012 election broke records with 31.7 million political tweets.



                      625,000 new users on Google+ every day


                 27% of small and 34% of medium businesses are using social media for business
                 (+20% YoY)

                                                                                                 8
Source: www.huffingtonpost.com
Social Networks and Web 2.0: Big Audience, Big Data



                           β€’ Even older people use social networks
                           β€’ Important for web data analysis




                                                              9
Social Networks: Real Time Update
                2012 Greece clashes
      β€’ Fresh news: search engine vs social network




                                                      10
Social Networks: Simultaneous Use




                                            11

Source: nielsen 2012
Social Networks and Web 2.0: Why Are So
              Important?

β€’ Know what most of people think in Web
β€’ Big data available almost free
β€’ Great audience and big slice of the real
  population
β€’ Real time feedback of news
β€’ Real time comments about tv shows or events
β€’ Share contents with million people
β€’ Clustering users according to age, interests, page
  likes..
β€’ …
                                                       12
Outline

β€’ Social Networks and Web 2.0
οƒΌ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
β€’ Sentiment analysis: an inside look
β€’ TwitterSA
β€’ TwitterSA Soccer Match Analysis
β€’ TwitterSA Predicting Elections



                                       13
What is Sentiment Analysis?
 Theory
Β«Sentiment Analysis or Opinion Mining
is the computational study of opinions,
sentiments and emotions expressed in
text.
        -- Bing Liu, 2010,
Β«Sentiment Analysis and SubjectivityΒ»


Practical
Using NLP, statistics, or Machine Learning methods to
extract and identify, how positive – negative is the sentiment
content expressed in a review, blog, discussion, news,
                                                                 14
comment or in any other document.
Why are online opinions so important?

β€’ Β«OpinionsΒ» are the influencers of our behaviours.

β€’ Before Β«making a decisionΒ», we usually seek out the
  opinion of the others:
   – Buy a product, rent a car, reserve an hotel room,
     looking for a good restaurant…

 Individuals: seek opinions from friends and family
 Organizations: use surveys, opinion polls, consultants




                                                           15
Why are online opinions so important?




                                        16
Sentiment Analysis Application Areas

– Organization/brand
   β€’ Know the org/brand reputation in Web
   β€’ Know consumers opinion about a product
   β€’ Understand consumers needs
   β€’ …
– Individuals
   β€’ Make decision before buy something
   β€’ Know aggregate sentiment for a product review
   β€’ Find public opinion about person, politician, …
– Research Studies
   β€’ Predict political results
   β€’ Predict box office
   β€’ Citizen polls
                                                       17
Sentiment Analysis Application Areas

– Marketing 2.0
   β€’ Advertisement Placements:
      – Place ads if one praises a product
      – Place ads from competitor if one dislikes a product
   β€’ Join Sentiment and recommendation systems
      – Know what kind of people praise a product (age,
        interest..)
   β€’ How people are responding to a product release/ad
     campaign
– Social tv
   β€’ Know what people think and audience about a tv
     show
   β€’ Do interactive polls during tv journal

                                                              18
Sentiment Analysis Application Areas




                                       19
Sentiment Analysis Application Areas

Some tools can also measure the overall sentiment expressed in blogs and social networks

Example: An earthquake produced a lot of negative sentiments




                                                                                   20
Sentiment Analysis Application Areas
       Sentiment integration with search engines




                                                   21
Sentiment Analysis Application Areas




                                       22
Also a fascinating problem
 Intellectually challenging & many applications
   A popular research topic in recent years (Shanahan, Qu, and Wiebe,
      2006 (edited book); Surveys - Pang and Lee 2008; Liu, 2006 and 2011;
      2010)
   More than 100 companies in USA alone
   Many workshop and conference
        http://sentimentsymposium.com/
        www.gplsi.dlsi.ua.es/congresos/wassa2012/
 A Large Research Area
     Opinion mining, Text Mining
     Sentiment and Subjectivity analysis
     Artificial Intelligence
     Natural Language Processing
     Computational Linguistic
     Etc.

                                                                             23
Sentiment Analysis and Social Web
            How to do that?
Easy: search the Web and find a Sentiment Analysis tools

β€’   http://www.twitalyzer.com/index.asp
β€’   http://twendz.waggeneredstrom.com/
β€’   http://www.sentiment140.com/
β€’   http://www.blogmeter.it
β€’   http://twitrratr.com/
β€’   http://www.socialmention.com
β€’   http://www.lovewillconquer.co.uk/
β€’   Hundred more..

And professional sites for companies
β€’ www.radian6.com                                          24
β€’ www.sysomos.com
Sentiment Analysis online tools

But you will find that Rita Levi Montalcini wasn’t
very popular




                                                     25
Sentiment Analysis online tools

Or was she?




                                     26
So again, how to do that?




                            27
Outline

β€’ Social Networks and Web 2.0
β€’ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
οƒΌ Sentiment Analysis: an inside look
β€’ TwitterSA
β€’ TwitterSA Soccer Match Analysis
β€’ TwitterSA Predicting Elections



                                       28
What is an opinion (1)?

β€œI bought an iPhone a few days ago. It is such a nice phone. The
touch screen is really cool. The voice quality is clear too. It is
much better than my old Blackberry, which was a terrible
phone and so difficult to type with its tiny keys. However, my
mother was mad with me as I did not tell her before I bought
the phone. She also thought the phone was too expensive, …”


Looking at this review is possible to do:

β€’ Document-level sentiment analysis: is this review + or -?
β€’ Sentence-level sentiment analysis: is each sentence + or -?
β€’ Entity-level sentiment analysis: is iPhone + or -?          29
What is an opinion (2)?
 An opinion is a quintuple

       (𝒆 𝒋 , 𝒂 π’‹π’Œ , 𝒔𝒐 π’Šπ’‹π’Œπ’ , 𝒉 π’Š , 𝒕 𝒍 )

where
 𝑒 𝑗 is the target entity (person, product, organization,
  event or a generic topic)
 π‘Ž π‘—π‘˜ is an aspect/feature of the entity
 π‘ π‘œ π‘–π‘—π‘˜π‘™ is the sentiment value of the opinion polarity :
  usually positive, negative or neutral
 β„Ž 𝑖 is the opinion holder
 𝑑 𝑙 is the time when opinion is expressed
                                                             30
What is an opinion (3)?

        Entity – Feature – Polarity – Opinion Holder – Time


β€’     I bought an iPhone and the touch screen is really cool.
      (Positive)
β€’     My old Blackberry, which was a terrible phone and so
      difficult to type with its tiny keys (Negative)



     In quintuples
       (iPhone, touch screen, positive, Author, review data)
       (Blackberry, keys, negative, Author, review data)
                                                                31
Sentiment Analysis is hard (1)!
                            Manage Negations
β€’ Direct Negation: β€˜I don't like my new Iphone’
β€’ Ambiguous Negation: β€˜Not only is this phone expensive but it's also
  heavy and difficult to use’
β€’ Indirect Negation: β€˜Perhaps it is a great phone, but I fail to see why’
                         Co-reference Resolution
β€’ β€˜We watched the movie and went to dinner; it was awful’
       What does β€˜it’ refers to??

                         Slang and Writing Errors
β€’   Shortform: nite (night), sayin (saying).
β€’   Acronyms: lol (laugh out loud), iirc (if I remember correctly).
β€’   Writing Errors: wouls(would), rediculous (ridiculous).
β€’   Punctuation Errors: im (I'm), dont (don't).
β€’   Slang: that was well mint (that was very good).
β€’   Repeated Letters: that was soooooo greeeat (that was so great).
β€’   Alphanumeric Words: 2night(tonight), str8(straight).                    32
Sentiment Analysis is hard (2)!
          Entity Disambiguation




                  ?
                                  33
Sentiment Analysis is hard (3)!
                           Manage Comparative
β€’ β€˜Federer is better than Nadal’
       Federer (+)
       Nadal (-)

                         Domain Dependent Opinion
β€’ β€˜The battery life is long’ (+)
β€’ β€˜The waiting time to enter at restaurant was too long’ (-)

                             More Challenges
β€’   Opinion Spam
β€’   Sarcasm
β€’   More general complexity of natural language
β€’   …


                                                               34
Sentiment Analysis is hard (4)!
 A company posted an ad for writing fake reviews on amazon.com
(65 cents per review)




                                                                  35
Sentiment Analysis: Known Approaches
          Building opinion words lexicon
β€’ Lexical Methods
  – Manual approach
  – Dictionary-based approach (Hu and Liu, 2004, Andreevskaia and
     Bergler, 2006, Dragut et al 2010)
  – Corpus-based approach (Hazivassiloglou and McKeown, 1997; Turney, 2002;
     Yu and Hazivassiloglou, 2003; Kanayama and Nasukawa, 2006; Ding, Liu and Yu, 2008)



β€’ Machine Learning
  – Unsupervised learning (Hatzivassiloglou and McKeown 1997, Yu and
     Hatzivassiloglou 2003)
  – Supervised learning (Alec Go et al 2009, Pang – Lee 2002, 2010 Pak – Paroubek)
  – Semi-supervised learning (Andreevskaia and Bergler, 2006 , Esuti and
     Sebastiani, 2005 )

                                                                                          36
Sentiment Analysis: Known Approaches
         Building opinion words lexicon
 β€’ Manual approach
       β€’ Pro: precision, no rules to define
       β€’ Cons: no automation, time for set up lexicon
 β€’ Dictionary-based approach
       β€’ Manual or prepared dictionary of positive – negative
         words. Expand dictionary with synonyms and antonyms.
       β€’ Pro: faster, semi-automated
       β€’ Cons: low precision (synonyms: great -> excellent and
         admirable but also -> large; big; fat)
 β€’ Corpus-based approach
       β€’ Seed set of positive – negative adjective (for example)
       β€’ Expand this set using grammar bindings
       β€’ Example: β€˜this car is beautiful and spacious’ ; if is known
         that beautiful is positive also spacious is positive.
       β€’ Pro: high automation, moderate precision                    37
       β€’ Cons: attention to grammar rules, word set isn’t complete
Outline

β€’ Social Networks and Web 2.0
β€’ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
β€’ S.A. an inside look
οƒΌ TwitterSA
β€’ TwitterSA Soccer Match Analysis
β€’ TwitterSA Predicting Elections



                                     38
Twitter: The Social Network
β€’   140 char max status length
β€’   Can add urls with multimedia
β€’   99% are public status
β€’   No friend: followers and following
β€’   Hashtag #




                                          39
TwitterSA




            40
TwitterSA: Machine Learning
Goal: Classify text input in Positive or Negative

              Supervised Algorithm
β€’ Must provide a set of inputs (Text phrase) and the
  appropriate outputs class (Positive or Negative) for
  those inputs.
β€’ Learning algorithm will train using those inputs.
  After that is able to classify a new instance.




                                                         41
TwitterSA: Multinomial Naive Bayes
         Naive Bayes Theorem

                  X = new text instance to classify
                   π‘ͺ 𝟏 . . π‘ͺ 𝒏 = possible class (Ex. Positive, Negative..)
                   𝑷(𝑿|π‘ͺ π’Š ) = products of probabilities that single attributes
                  of istance X appertein to class 𝐢 𝑖
                   𝑷(π‘ͺ π’Š |𝑿) = probability that new instance X appartein to
                  class 𝐢 𝑖




     X


         X                       P(C|X)
                                                                           42
Multinomial Naive Bayes: A Worked Example
                            Doc   Words vector   Class (𝐢 𝑖 )
                 Training   1     Love           C1 = Pos
                            2     Almost hate    C2 = Neg
                            3     Love           C1 = Pos
                 Test       4     Almost Love    ?




                                                        43
Multinomial Naive Bayes: A Worked Example
                                            Doc   Words vector   Class (𝐢 𝑖 )
                                 Training   1     Love           C1 = Pos
       𝑁𝑐          𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘               2     Almost hate    C2 = Neg
𝑃 𝐢𝑖 =
       𝑁                                    3     Love           C1 = Pos
             π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1    Test       4     Almost Love    ?
𝑃 𝑋 | 𝐢𝑖 =
             π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉|

               2                  1
𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 =         𝑷 π‘ͺ𝟐 = π’π’†π’ˆ =
               3                  3




                                                                        44
Multinomial Naive Bayes: A Worked Example
                                                    Doc   Words vector   Class (𝐢 𝑖 )
                                         Training   1     Love           C1 = Pos
        𝑁𝑐         𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘                       2     Almost hate    C2 = Neg
 𝑃 𝐢𝑖 =
        𝑁                                           3     Love           C1 = Pos
               π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1          Test       4     Almost Love    ?
 𝑃 𝑋 | 𝐢𝑖 =
               π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉|

              2                1
𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 =      𝑷 π‘ͺ𝟐 = π’π’†π’ˆ =
              3                3
Conditional Probabilities
                     2+1
  𝑃 πΏπ‘œπ‘£π‘’     π‘ƒπ‘œπ‘  ) = (2+3) = 3/5

                          0+1
  𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘     π‘ƒπ‘œπ‘  ) =           = 1/5
                         (2+3)

                      0+1
  𝑃 πΏπ‘œπ‘£π‘’     𝑁𝑒𝑔 ) = (2+3) = 1/5

                         1+1                                                    45
  𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ 𝑁𝑒𝑔 ) = (2+3) = 2/5
Multinomial Naive Bayes: A Worked Example
                                                        Doc      Words vector         Class (𝐢 𝑖 )
                                         Training       1        Love                 C1 = Pos
        𝑁𝑐         𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘                           2        Almost hate          C2 = Neg
 𝑃 𝐢𝑖 =
        𝑁                                               3        Love                 C1 = Pos
               π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1          Test           4        Almost Love          ?
 𝑃 𝑋 | 𝐢𝑖 =
               π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉|

              2                1
𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 =      𝑷 π‘ͺ𝟐 = π’π’†π’ˆ =
              3                3
Conditional Probabilities
                     2+1
  𝑃 πΏπ‘œπ‘£π‘’     π‘ƒπ‘œπ‘  ) = (2+3) = 3/5                    Choosing a class

                          0+1
  𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘     π‘ƒπ‘œπ‘  ) =           = 1/5
                         (2+3)                  𝑃 π‘π‘œπ‘        π‘‘π‘œπ‘4) = 2/3 * 1/5 * 3/5 = 0,08
                      0+1
  𝑃 πΏπ‘œπ‘£π‘’     𝑁𝑒𝑔 ) = (2+3) = 1/5
                                                𝑃 𝑛𝑒𝑔       π‘‘π‘œπ‘4) = 1/3 * 1/5 * 2/5 = 0,026
                         1+1                                                                  46
  𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ 𝑁𝑒𝑔 ) = (2+3) = 2/5
Multinomial Naive Bayes: A Worked Example
    Input corpus: attributes and classes




                                           Training weights




                                                              47
TwitterSA: Collecting Corpus
β€’ Big corpus for training with label annotation!!
β€’ Different methods from corpus-based or dictionary-based
  approach
β€’ Collecting big sentiment corpus starting from noise label
   – Bag of words for training bayesian learning algorithm
   – Found that iPhone and :) can contain positive sentiment and :( the
     contrary
   – Discovered that also hashtags can be used as noise label




                                 o β€˜Recently I've started developing a love for
                                   indie music ... #loveit’
                                 o β€˜I have to say, I am so impressed with this
                                   iPhone5. I will never ever go back to a Droid.
                                   #loveit #happy’
                                                                              48
TwitterSA: Text Processing and
              Normalization
TwitterSA process: many modules

Normalization of repeated letters and
alphanumeric

   Discard terms with high Entropy and low
   Salience

       Manage negation for sentiment training


           Convert slang words to normal form


               Unigram, Bigram for training
                                                49
TwitterSA: Vector of Feature (1)
LIWC Dictionary
                                        MPQA Dictionary




         Linguistic categories
                                   Input: β€˜Happy Birthday Steve
                                   Jobs your iPhone is amazing’




                                 {Pos, Neg}       LIWC Categories

                                  {1,0}        {posEmo, affect, ..}
                                                                      50
TwitterSA: Vector of Feature (2)
                                           POS Tag         Description               Example
                                           CC        conjunction              and, but, or, &
                 POS Tagger                CD        cardinal number          1, three
  Input: He is the best                    DT        determiner               the
  Output: He|PRP is|VBP the|DT best|JJS    JJ        adjective                green
                                           JJR       adjective, comparative   greener
                                           JJS       adjective, superlative   greenest
                                           …         …                        …
   1
0,8
0,6               Negative Sentence                          Positive Sentence
        Personal pronouns and possessive             Adjective and superlative adverb.
0,4     Comparative adjective                        Proper Noun
        Verbs in past tense
0,2
  0

        NNS
             JJ




        NNP
           JJR




         MD




          CD
          WP


         POS
          FW




         PRP
          TO



          CC
       -LRB-




           RP




          NN



        PDT
         RBR




           JJS
         RBS
       -RRB-
        PRP$




          DT
           RB




            IN




        WDT
          UH
        WRB




       NNPS
         VBZ



         VBP



           VB

        VBG
        VBN
        VBD




-0,2
-0,4
-0,6
-0,8
             Tag occurrence in positive and negative sentence                                   51
  -1
TwitterSA: Vector of Feature (3)
             Pattern Mask
β€˜The combination of one or more near tag’


β€’ Input
   β€’ He|PRP is|VBP the|DT best|JJS

β€’ Example output
   β€’ PRP|VBP ; PRP|VBP|DT ; PRP|VBP|DT|JJS ; etc.

β€’ Discover most frequency pattern mask in positive and negative
  sentence.


                                             {Pos, Neg}
  For example an input PRP|VBP|DT|JJS
  occurs almost in positive sentence           {1,0}
                                                                  52
TwitterSA: How Much is Accurate
β€’ N-Fold Cross Validation (average results)




β€’ Split corpus:
      70% Training; 30% Test

β€’ Manual corpus for testing

                                              53
TwitterSA: Testing
                  Classification problem (not Retrieval)
Precision: is positive predictive value, or correctly classified instance
Recall: or Sensitivity is the proportion of actual positives which are
correctly identified as such
                                                           Confusion Matrix

                                                            Classified Classified
                                                            Positive   Negative
                                              Predicted TP                  FN
                                              Positive
                                              Predicted FP                  TN
                                              Negative




                                                                                 54
Outline

β€’ Social Networks and Web 2.0
β€’ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
β€’ S.A. an inside look
β€’ TwitterSA
οƒΌ TwitterSA Soccer Match Analysis
β€’ TwitterSA Predicting Elections



                                     55
TwitterSA: Soccer Match Analysis

               What
Monitor soccer match on Twitter
Milan – Inter, Seria A Season 2011-2012


                                 Goal
β€’ Understand with automatic sentiment analysis the behaviour of the match
β€’ Who wins? How many goals?




                                                                            56
TwitterSA: Soccer Match Analysis
         Volume of tweets Inter and Milan
 1600
 1400                        Match Start

 1200
 1000
 800                                         Tweets
 600
 400
  200
   0
        Mon          Fri       Sun     Mon



                                                      57
TwitterSA: Soccer Match Analysis
         Volume of tweets Inter and Milan
 1600
 1400                             Match Start
              Coaches Interview
 1200
 1000
 800                                              Tweets
 600
 400
  200
   0
        Mon             Fri         Sun     Mon



                                                           58
TwitterSA: Soccer Match Analysis
         Volume of tweets Inter and Milan
 1600
 1400                             Match Start
                                                  Goal: which team?
              Coaches Interview
 1200
 1000
 800                                                 Tweets
 600
 400
  200
   0
        Mon             Fri         Sun     Mon



                                                                  59
TwitterSA: Soccer Match Analysis
                                       Sentiment Analysis
                    90
                                                   Goal: which team?
                    85


                    80
% Positive Tweets




                    75


                    70
                                                                                       Milan pos
                                                                                       Inter pos
                    65


                    60


                    55


                    50
                         match day   before tweet peak     match end   the day after
                                                   timeline
                                                                                                   60
TwitterSA: Soccer Match Analysis
β€’ Who won?
β€’ How many goals?




                                       61
TwitterSA: Soccer Match Analysis
β€’ Who won?
β€’ How many goals?




                                       62
Outline

β€’ Social Networks and Web 2.0
β€’ Sentiment analysis: what is it?
β€’ Sentiment analysis: applications
β€’ S.A. an inside look
β€’ TwitterSA
β€’ TwitterSA Soccer Match Analysis
οƒΌ TwitterSA Predicting Elections



                                     63
TwitterSA: Predicting Elections




                                  64
TwitterSA: Predicting Elections




                                  65
TwitterSA: Predicting Elections




                                  66
TwitterSA: Predicting Elections




                                  67
TwitterSA: Predicting Elections




                                  68
TwitterSA: Predicting Elections




                                  69
TwitterSA: Predicting Elections

                         Results of elections




Full article and infographics at
http://davidefeltoni.wordpress.com
                                                70
References
β€’   Dr. Diana Maynard: Practical Sentiment Analysis
β€’   Seth Grimes: Sentiment Analysis Symposium 2012
β€’   B. Liu reference is available here:
    http://www.cs.uic.edu/~liub/FBS/AAAI-2011-tutorial-references.pdf
β€’   Best Survey about Sentiment Analysis: B. Liu β€˜Sentiment Analysis and
    Subjectivity’ chapter is available here:
    http://www.cs.uic.edu/~liub/FBS/NLP-handbook-sentiment-
    analysis.pdf
β€’   Great tutorial for Sentiment Analysis:
    http://sentiment.christopherpotts.net/
β€’   Some images and statistics are taken from www.basistech.com,
    www.nielsen.com




                                                                       71
β€˜Quando in codesto sentire ti senti veramente felice,
chiamalo pure come vuoi: chiamalo felicitΓ , cuore,
amore. Per questo io non ho nome alcuno.
Sentimento Γ¨ tutto! La parola Γ¨ soltanto suono e fumo.’’

Johann Wolfgang von Goethe




                                                           72

More Related Content

What's hot

Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisJaganadh Gopinadhan
Β 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesKarol Chlasta
Β 
Sentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataSentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataHari Prasad
Β 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using mlPravin Katiyar
Β 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisAditya Nag
Β 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonHetu Bhavsar
Β 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on TwitterSubarno Pal
Β 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysisSeher Can
Β 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment AnalysisRebecca Williams
Β 
Opinion Mining
Opinion MiningOpinion Mining
Opinion MiningAli Habeeb
Β 
Sentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewSentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewAbdullah Moin
Β 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis worksCJ Jenkins
Β 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter DataNurendra Choudhary
Β 
Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Rachit Goel
Β 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSangeeth Nagarajan
Β 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysisSunil Kandari
Β 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysisAkhila
Β 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on TwitterNitish J Prabhu
Β 
Sentimental analysis
Sentimental analysisSentimental analysis
Sentimental analysisAnkit Khera
Β 

What's hot (20)

Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
Β 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
Β 
Sentiment Analysis using Twitter Data
Sentiment Analysis using Twitter DataSentiment Analysis using Twitter Data
Sentiment Analysis using Twitter Data
Β 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
Β 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
Β 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
Β 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
Β 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
Β 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
Β 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Β 
Sentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewSentiment Analysis Using Product Review
Sentiment Analysis Using Product Review
Β 
Amazon seniment
Amazon senimentAmazon seniment
Amazon seniment
Β 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
Β 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
Β 
Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14
Β 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Β 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysis
Β 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysis
Β 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on Twitter
Β 
Sentimental analysis
Sentimental analysisSentimental analysis
Sentimental analysis
Β 

Similar to Sentiment Analysis and Social Media: How and Why

Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Hartford Foundation for Public Giving
Β 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsAmit Sheth
Β 
Jour 504 pr research 11 8-10
Jour 504 pr research 11 8-10Jour 504 pr research 11 8-10
Jour 504 pr research 11 8-10Dana Chinn
Β 
Social Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionSocial Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionContent Savvy
Β 
Lecture1 2013
Lecture1 2013Lecture1 2013
Lecture1 2013Matt holland
Β 
Nycon social media nyfa presentation
Nycon social media nyfa presentationNycon social media nyfa presentation
Nycon social media nyfa presentationAndrew Marietta
Β 
Understanding and capitalizing on the social media revolution
Understanding and capitalizing on the social media revolutionUnderstanding and capitalizing on the social media revolution
Understanding and capitalizing on the social media revolutionMamoun Matar
Β 
5 Timesaving Tools for Managing the Overwhelming World of Social Media
5 Timesaving Tools for Managing the Overwhelming World of Social Media5 Timesaving Tools for Managing the Overwhelming World of Social Media
5 Timesaving Tools for Managing the Overwhelming World of Social MediaOff Madison Ave
Β 
14th strategic planning skills bringing social media into the strategic plann...
14th strategic planning skills bringing social media into the strategic plann...14th strategic planning skills bringing social media into the strategic plann...
14th strategic planning skills bringing social media into the strategic plann...Michael Rabinovici
Β 
TAF&E - Using Social and Mobile To Increase ROI
TAF&E - Using Social and Mobile To Increase ROITAF&E - Using Social and Mobile To Increase ROI
TAF&E - Using Social and Mobile To Increase ROISarah Page
Β 
Social media as communication tool draft
Social media as communication tool  draftSocial media as communication tool  draft
Social media as communication tool draftMamoun Matar
Β 
ALIGN 2011 social media session
ALIGN 2011 social media sessionALIGN 2011 social media session
ALIGN 2011 social media sessionDan Cohen
Β 
analyzing public sentiments using twitter feeds
 analyzing public sentiments using twitter feeds analyzing public sentiments using twitter feeds
analyzing public sentiments using twitter feedsOrakzay
Β 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationMarieke van Erp
Β 
National Geographic - Omniture Cafe 6/11/09
National Geographic - Omniture Cafe 6/11/09National Geographic - Omniture Cafe 6/11/09
National Geographic - Omniture Cafe 6/11/09Ted McDonald
Β 
Social smarts presenation marshall sponder - updated 9-16-12
Social smarts presenation   marshall sponder - updated 9-16-12Social smarts presenation   marshall sponder - updated 9-16-12
Social smarts presenation marshall sponder - updated 9-16-12Marshall Sponder
Β 
Lasa cyp social media
Lasa cyp social mediaLasa cyp social media
Lasa cyp social mediaMiles Maier
Β 

Similar to Sentiment Analysis and Social Media: How and Why (20)

Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Β 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
Β 
Jour 504 pr research 11 8-10
Jour 504 pr research 11 8-10Jour 504 pr research 11 8-10
Jour 504 pr research 11 8-10
Β 
Social Media Analytics: The Value Proposition
Social Media Analytics: The Value PropositionSocial Media Analytics: The Value Proposition
Social Media Analytics: The Value Proposition
Β 
Bsm wk iii_f12
Bsm wk iii_f12Bsm wk iii_f12
Bsm wk iii_f12
Β 
Lecture1 2013
Lecture1 2013Lecture1 2013
Lecture1 2013
Β 
Nycon social media nyfa presentation
Nycon social media nyfa presentationNycon social media nyfa presentation
Nycon social media nyfa presentation
Β 
Understanding and capitalizing on the social media revolution
Understanding and capitalizing on the social media revolutionUnderstanding and capitalizing on the social media revolution
Understanding and capitalizing on the social media revolution
Β 
Social Media Dataset
Social Media DatasetSocial Media Dataset
Social Media Dataset
Β 
5 Timesaving Tools for Managing the Overwhelming World of Social Media
5 Timesaving Tools for Managing the Overwhelming World of Social Media5 Timesaving Tools for Managing the Overwhelming World of Social Media
5 Timesaving Tools for Managing the Overwhelming World of Social Media
Β 
14th strategic planning skills bringing social media into the strategic plann...
14th strategic planning skills bringing social media into the strategic plann...14th strategic planning skills bringing social media into the strategic plann...
14th strategic planning skills bringing social media into the strategic plann...
Β 
TAF&E - Using Social and Mobile To Increase ROI
TAF&E - Using Social and Mobile To Increase ROITAF&E - Using Social and Mobile To Increase ROI
TAF&E - Using Social and Mobile To Increase ROI
Β 
Social media as communication tool draft
Social media as communication tool  draftSocial media as communication tool  draft
Social media as communication tool draft
Β 
ALIGN 2011 social media session
ALIGN 2011 social media sessionALIGN 2011 social media session
ALIGN 2011 social media session
Β 
analyzing public sentiments using twitter feeds
 analyzing public sentiments using twitter feeds analyzing public sentiments using twitter feeds
analyzing public sentiments using twitter feeds
Β 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
Β 
National Geographic - Omniture Cafe 6/11/09
National Geographic - Omniture Cafe 6/11/09National Geographic - Omniture Cafe 6/11/09
National Geographic - Omniture Cafe 6/11/09
Β 
Social smarts presenation marshall sponder - updated 9-16-12
Social smarts presenation   marshall sponder - updated 9-16-12Social smarts presenation   marshall sponder - updated 9-16-12
Social smarts presenation marshall sponder - updated 9-16-12
Β 
Social media
Social mediaSocial media
Social media
Β 
Lasa cyp social media
Lasa cyp social mediaLasa cyp social media
Lasa cyp social media
Β 

Recently uploaded

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
Β 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
Β 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
Β 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
Β 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
Β 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
Β 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
Β 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
Β 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
Β 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
Β 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
Β 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel AraΓΊjo
Β 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
Β 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
Β 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
Β 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
Β 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
Β 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
Β 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
Β 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
Β 

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Β 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Β 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Β 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Β 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Β 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Β 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Β 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Β 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
Β 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Β 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Β 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Β 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Β 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
Β 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
Β 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Β 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Β 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
Β 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
Β 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
Β 

Sentiment Analysis and Social Media: How and Why

  • 1. Sentiment Analysis in Social Media How and Why Davide Feltoni Gurini 1 Sistemi Intelligenti per Internet A.A. 2012/2013
  • 2. About me and Contacts 1Β° Year Ph.D. in Computer Engineering at Roma Tre University Via della Vasca Navale 79, Rome A.I. Lab. Room 2.02a Contacts: β€’ feltoni@dia.uniroma3.it β€’ http://about.me/davidefeltoni 2
  • 3. Outline οƒΌ Social Networks and Web 2.0 β€’ Sentiment analysis: what is it? β€’ Sentiment analysis: applications β€’ Sentiment analysis: an inside look β€’ TwitterSA β€’ TwitterSA Soccer Match Analysis β€’ TwitterSA Predicting Elections 3
  • 4. Social Networks and Web 2.0 Common Uses: β€’ Spend time on internet! β€’ Virtual network of friends β€’ Gather info about something β€’ Share people what we do β€’ Know what other people do β€’ Know what is happening in the world β€’ … 4
  • 5. Social Networks and Web 2.0 Behind the scenes β€’ Data Analyst β€’ Statistical studies β€’ How to collect and store big data β€’ Business Analyst β€’ Social marketing β€’ Web Advertising β€’ Web and App Developer β€’ Integrate social in games and apps β€’ Ph.D. and Researcher β€’ Research methods and publish articles 5
  • 6. Social Networks and Web 2.0 Evolution of Web β€’ 1.0 (web for experts) 2.0 (web for everyone) β€’ Users are not only reading β€’ Web is a data container β€’ Users now generate more content β€’ Blog, Social networks, write reviews, … β€’ Collaborative sites (Wikipedia, Communities, Forum) 6
  • 7. Social Networks: Overview of Italy 7 Source: nielsen 2012
  • 8. Social Networks: Some Statistics Facebook monthly active users now total nearly 850 million People spent about 11 hour at month considering only Facebook There are 175 million tweets sent from Twitter every day in 2012 The 2012 election broke records with 31.7 million political tweets. 625,000 new users on Google+ every day 27% of small and 34% of medium businesses are using social media for business (+20% YoY) 8 Source: www.huffingtonpost.com
  • 9. Social Networks and Web 2.0: Big Audience, Big Data β€’ Even older people use social networks β€’ Important for web data analysis 9
  • 10. Social Networks: Real Time Update 2012 Greece clashes β€’ Fresh news: search engine vs social network 10
  • 11. Social Networks: Simultaneous Use 11 Source: nielsen 2012
  • 12. Social Networks and Web 2.0: Why Are So Important? β€’ Know what most of people think in Web β€’ Big data available almost free β€’ Great audience and big slice of the real population β€’ Real time feedback of news β€’ Real time comments about tv shows or events β€’ Share contents with million people β€’ Clustering users according to age, interests, page likes.. β€’ … 12
  • 13. Outline β€’ Social Networks and Web 2.0 οƒΌ Sentiment analysis: what is it? β€’ Sentiment analysis: applications β€’ Sentiment analysis: an inside look β€’ TwitterSA β€’ TwitterSA Soccer Match Analysis β€’ TwitterSA Predicting Elections 13
  • 14. What is Sentiment Analysis? Theory Β«Sentiment Analysis or Opinion Mining is the computational study of opinions, sentiments and emotions expressed in text. -- Bing Liu, 2010, Β«Sentiment Analysis and SubjectivityΒ» Practical Using NLP, statistics, or Machine Learning methods to extract and identify, how positive – negative is the sentiment content expressed in a review, blog, discussion, news, 14 comment or in any other document.
  • 15. Why are online opinions so important? β€’ Β«OpinionsΒ» are the influencers of our behaviours. β€’ Before Β«making a decisionΒ», we usually seek out the opinion of the others: – Buy a product, rent a car, reserve an hotel room, looking for a good restaurant…  Individuals: seek opinions from friends and family  Organizations: use surveys, opinion polls, consultants 15
  • 16. Why are online opinions so important? 16
  • 17. Sentiment Analysis Application Areas – Organization/brand β€’ Know the org/brand reputation in Web β€’ Know consumers opinion about a product β€’ Understand consumers needs β€’ … – Individuals β€’ Make decision before buy something β€’ Know aggregate sentiment for a product review β€’ Find public opinion about person, politician, … – Research Studies β€’ Predict political results β€’ Predict box office β€’ Citizen polls 17
  • 18. Sentiment Analysis Application Areas – Marketing 2.0 β€’ Advertisement Placements: – Place ads if one praises a product – Place ads from competitor if one dislikes a product β€’ Join Sentiment and recommendation systems – Know what kind of people praise a product (age, interest..) β€’ How people are responding to a product release/ad campaign – Social tv β€’ Know what people think and audience about a tv show β€’ Do interactive polls during tv journal 18
  • 20. Sentiment Analysis Application Areas Some tools can also measure the overall sentiment expressed in blogs and social networks Example: An earthquake produced a lot of negative sentiments 20
  • 21. Sentiment Analysis Application Areas Sentiment integration with search engines 21
  • 23. Also a fascinating problem  Intellectually challenging & many applications  A popular research topic in recent years (Shanahan, Qu, and Wiebe, 2006 (edited book); Surveys - Pang and Lee 2008; Liu, 2006 and 2011; 2010)  More than 100 companies in USA alone  Many workshop and conference  http://sentimentsymposium.com/  www.gplsi.dlsi.ua.es/congresos/wassa2012/  A Large Research Area  Opinion mining, Text Mining  Sentiment and Subjectivity analysis  Artificial Intelligence  Natural Language Processing  Computational Linguistic  Etc. 23
  • 24. Sentiment Analysis and Social Web How to do that? Easy: search the Web and find a Sentiment Analysis tools β€’ http://www.twitalyzer.com/index.asp β€’ http://twendz.waggeneredstrom.com/ β€’ http://www.sentiment140.com/ β€’ http://www.blogmeter.it β€’ http://twitrratr.com/ β€’ http://www.socialmention.com β€’ http://www.lovewillconquer.co.uk/ β€’ Hundred more.. And professional sites for companies β€’ www.radian6.com 24 β€’ www.sysomos.com
  • 25. Sentiment Analysis online tools But you will find that Rita Levi Montalcini wasn’t very popular 25
  • 26. Sentiment Analysis online tools Or was she? 26
  • 27. So again, how to do that? 27
  • 28. Outline β€’ Social Networks and Web 2.0 β€’ Sentiment analysis: what is it? β€’ Sentiment analysis: applications οƒΌ Sentiment Analysis: an inside look β€’ TwitterSA β€’ TwitterSA Soccer Match Analysis β€’ TwitterSA Predicting Elections 28
  • 29. What is an opinion (1)? β€œI bought an iPhone a few days ago. It is such a nice phone. The touch screen is really cool. The voice quality is clear too. It is much better than my old Blackberry, which was a terrible phone and so difficult to type with its tiny keys. However, my mother was mad with me as I did not tell her before I bought the phone. She also thought the phone was too expensive, …” Looking at this review is possible to do: β€’ Document-level sentiment analysis: is this review + or -? β€’ Sentence-level sentiment analysis: is each sentence + or -? β€’ Entity-level sentiment analysis: is iPhone + or -? 29
  • 30. What is an opinion (2)?  An opinion is a quintuple (𝒆 𝒋 , 𝒂 π’‹π’Œ , 𝒔𝒐 π’Šπ’‹π’Œπ’ , 𝒉 π’Š , 𝒕 𝒍 ) where  𝑒 𝑗 is the target entity (person, product, organization, event or a generic topic)  π‘Ž π‘—π‘˜ is an aspect/feature of the entity  π‘ π‘œ π‘–π‘—π‘˜π‘™ is the sentiment value of the opinion polarity : usually positive, negative or neutral  β„Ž 𝑖 is the opinion holder  𝑑 𝑙 is the time when opinion is expressed 30
  • 31. What is an opinion (3)? Entity – Feature – Polarity – Opinion Holder – Time β€’ I bought an iPhone and the touch screen is really cool. (Positive) β€’ My old Blackberry, which was a terrible phone and so difficult to type with its tiny keys (Negative)  In quintuples (iPhone, touch screen, positive, Author, review data) (Blackberry, keys, negative, Author, review data) 31
  • 32. Sentiment Analysis is hard (1)! Manage Negations β€’ Direct Negation: β€˜I don't like my new Iphone’ β€’ Ambiguous Negation: β€˜Not only is this phone expensive but it's also heavy and difficult to use’ β€’ Indirect Negation: β€˜Perhaps it is a great phone, but I fail to see why’ Co-reference Resolution β€’ β€˜We watched the movie and went to dinner; it was awful’ What does β€˜it’ refers to?? Slang and Writing Errors β€’ Shortform: nite (night), sayin (saying). β€’ Acronyms: lol (laugh out loud), iirc (if I remember correctly). β€’ Writing Errors: wouls(would), rediculous (ridiculous). β€’ Punctuation Errors: im (I'm), dont (don't). β€’ Slang: that was well mint (that was very good). β€’ Repeated Letters: that was soooooo greeeat (that was so great). β€’ Alphanumeric Words: 2night(tonight), str8(straight). 32
  • 33. Sentiment Analysis is hard (2)! Entity Disambiguation ? 33
  • 34. Sentiment Analysis is hard (3)! Manage Comparative β€’ β€˜Federer is better than Nadal’ Federer (+) Nadal (-) Domain Dependent Opinion β€’ β€˜The battery life is long’ (+) β€’ β€˜The waiting time to enter at restaurant was too long’ (-) More Challenges β€’ Opinion Spam β€’ Sarcasm β€’ More general complexity of natural language β€’ … 34
  • 35. Sentiment Analysis is hard (4)!  A company posted an ad for writing fake reviews on amazon.com (65 cents per review) 35
  • 36. Sentiment Analysis: Known Approaches Building opinion words lexicon β€’ Lexical Methods – Manual approach – Dictionary-based approach (Hu and Liu, 2004, Andreevskaia and Bergler, 2006, Dragut et al 2010) – Corpus-based approach (Hazivassiloglou and McKeown, 1997; Turney, 2002; Yu and Hazivassiloglou, 2003; Kanayama and Nasukawa, 2006; Ding, Liu and Yu, 2008) β€’ Machine Learning – Unsupervised learning (Hatzivassiloglou and McKeown 1997, Yu and Hatzivassiloglou 2003) – Supervised learning (Alec Go et al 2009, Pang – Lee 2002, 2010 Pak – Paroubek) – Semi-supervised learning (Andreevskaia and Bergler, 2006 , Esuti and Sebastiani, 2005 ) 36
  • 37. Sentiment Analysis: Known Approaches Building opinion words lexicon β€’ Manual approach β€’ Pro: precision, no rules to define β€’ Cons: no automation, time for set up lexicon β€’ Dictionary-based approach β€’ Manual or prepared dictionary of positive – negative words. Expand dictionary with synonyms and antonyms. β€’ Pro: faster, semi-automated β€’ Cons: low precision (synonyms: great -> excellent and admirable but also -> large; big; fat) β€’ Corpus-based approach β€’ Seed set of positive – negative adjective (for example) β€’ Expand this set using grammar bindings β€’ Example: β€˜this car is beautiful and spacious’ ; if is known that beautiful is positive also spacious is positive. β€’ Pro: high automation, moderate precision 37 β€’ Cons: attention to grammar rules, word set isn’t complete
  • 38. Outline β€’ Social Networks and Web 2.0 β€’ Sentiment analysis: what is it? β€’ Sentiment analysis: applications β€’ S.A. an inside look οƒΌ TwitterSA β€’ TwitterSA Soccer Match Analysis β€’ TwitterSA Predicting Elections 38
  • 39. Twitter: The Social Network β€’ 140 char max status length β€’ Can add urls with multimedia β€’ 99% are public status β€’ No friend: followers and following β€’ Hashtag # 39
  • 40. TwitterSA 40
  • 41. TwitterSA: Machine Learning Goal: Classify text input in Positive or Negative Supervised Algorithm β€’ Must provide a set of inputs (Text phrase) and the appropriate outputs class (Positive or Negative) for those inputs. β€’ Learning algorithm will train using those inputs. After that is able to classify a new instance. 41
  • 42. TwitterSA: Multinomial Naive Bayes Naive Bayes Theorem X = new text instance to classify π‘ͺ 𝟏 . . π‘ͺ 𝒏 = possible class (Ex. Positive, Negative..) 𝑷(𝑿|π‘ͺ π’Š ) = products of probabilities that single attributes of istance X appertein to class 𝐢 𝑖 𝑷(π‘ͺ π’Š |𝑿) = probability that new instance X appartein to class 𝐢 𝑖 X X P(C|X) 42
  • 43. Multinomial Naive Bayes: A Worked Example Doc Words vector Class (𝐢 𝑖 ) Training 1 Love C1 = Pos 2 Almost hate C2 = Neg 3 Love C1 = Pos Test 4 Almost Love ? 43
  • 44. Multinomial Naive Bayes: A Worked Example Doc Words vector Class (𝐢 𝑖 ) Training 1 Love C1 = Pos 𝑁𝑐 𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘  2 Almost hate C2 = Neg 𝑃 𝐢𝑖 = 𝑁 3 Love C1 = Pos π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1 Test 4 Almost Love ? 𝑃 𝑋 | 𝐢𝑖 = π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉| 2 1 𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 = 𝑷 π‘ͺ𝟐 = π’π’†π’ˆ = 3 3 44
  • 45. Multinomial Naive Bayes: A Worked Example Doc Words vector Class (𝐢 𝑖 ) Training 1 Love C1 = Pos 𝑁𝑐 𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘  2 Almost hate C2 = Neg 𝑃 𝐢𝑖 = 𝑁 3 Love C1 = Pos π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1 Test 4 Almost Love ? 𝑃 𝑋 | 𝐢𝑖 = π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉| 2 1 𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 = 𝑷 π‘ͺ𝟐 = π’π’†π’ˆ = 3 3 Conditional Probabilities 2+1 𝑃 πΏπ‘œπ‘£π‘’ π‘ƒπ‘œπ‘  ) = (2+3) = 3/5 0+1 𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ π‘ƒπ‘œπ‘  ) = = 1/5 (2+3) 0+1 𝑃 πΏπ‘œπ‘£π‘’ 𝑁𝑒𝑔 ) = (2+3) = 1/5 1+1 45 𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ 𝑁𝑒𝑔 ) = (2+3) = 2/5
  • 46. Multinomial Naive Bayes: A Worked Example Doc Words vector Class (𝐢 𝑖 ) Training 1 Love C1 = Pos 𝑁𝑐 𝑉 = 3 π‘€π‘œπ‘Ÿπ‘‘π‘  2 Almost hate C2 = Neg 𝑃 𝐢𝑖 = 𝑁 3 Love C1 = Pos π‘π‘œπ‘’π‘›π‘‘ 𝑋, 𝐢 𝑖 + 1 Test 4 Almost Love ? 𝑃 𝑋 | 𝐢𝑖 = π‘π‘œπ‘’π‘›π‘‘ 𝐢 𝑖 + |𝑉| 2 1 𝑷 π‘ͺ𝟏 = 𝒑𝒐𝒔 = 𝑷 π‘ͺ𝟐 = π’π’†π’ˆ = 3 3 Conditional Probabilities 2+1 𝑃 πΏπ‘œπ‘£π‘’ π‘ƒπ‘œπ‘  ) = (2+3) = 3/5 Choosing a class 0+1 𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ π‘ƒπ‘œπ‘  ) = = 1/5 (2+3) 𝑃 π‘π‘œπ‘  π‘‘π‘œπ‘4) = 2/3 * 1/5 * 3/5 = 0,08 0+1 𝑃 πΏπ‘œπ‘£π‘’ 𝑁𝑒𝑔 ) = (2+3) = 1/5 𝑃 𝑛𝑒𝑔 π‘‘π‘œπ‘4) = 1/3 * 1/5 * 2/5 = 0,026 1+1 46 𝑃 π΄π‘™π‘šπ‘œπ‘ π‘‘ 𝑁𝑒𝑔 ) = (2+3) = 2/5
  • 47. Multinomial Naive Bayes: A Worked Example Input corpus: attributes and classes Training weights 47
  • 48. TwitterSA: Collecting Corpus β€’ Big corpus for training with label annotation!! β€’ Different methods from corpus-based or dictionary-based approach β€’ Collecting big sentiment corpus starting from noise label – Bag of words for training bayesian learning algorithm – Found that iPhone and :) can contain positive sentiment and :( the contrary – Discovered that also hashtags can be used as noise label o β€˜Recently I've started developing a love for indie music ... #loveit’ o β€˜I have to say, I am so impressed with this iPhone5. I will never ever go back to a Droid. #loveit #happy’ 48
  • 49. TwitterSA: Text Processing and Normalization TwitterSA process: many modules Normalization of repeated letters and alphanumeric Discard terms with high Entropy and low Salience Manage negation for sentiment training Convert slang words to normal form Unigram, Bigram for training 49
  • 50. TwitterSA: Vector of Feature (1) LIWC Dictionary MPQA Dictionary Linguistic categories Input: β€˜Happy Birthday Steve Jobs your iPhone is amazing’ {Pos, Neg} LIWC Categories {1,0} {posEmo, affect, ..} 50
  • 51. TwitterSA: Vector of Feature (2) POS Tag Description Example CC conjunction and, but, or, & POS Tagger CD cardinal number 1, three Input: He is the best DT determiner the Output: He|PRP is|VBP the|DT best|JJS JJ adjective green JJR adjective, comparative greener JJS adjective, superlative greenest … … … 1 0,8 0,6 Negative Sentence Positive Sentence Personal pronouns and possessive Adjective and superlative adverb. 0,4 Comparative adjective Proper Noun Verbs in past tense 0,2 0 NNS JJ NNP JJR MD CD WP POS FW PRP TO CC -LRB- RP NN PDT RBR JJS RBS -RRB- PRP$ DT RB IN WDT UH WRB NNPS VBZ VBP VB VBG VBN VBD -0,2 -0,4 -0,6 -0,8 Tag occurrence in positive and negative sentence 51 -1
  • 52. TwitterSA: Vector of Feature (3) Pattern Mask β€˜The combination of one or more near tag’ β€’ Input β€’ He|PRP is|VBP the|DT best|JJS β€’ Example output β€’ PRP|VBP ; PRP|VBP|DT ; PRP|VBP|DT|JJS ; etc. β€’ Discover most frequency pattern mask in positive and negative sentence. {Pos, Neg} For example an input PRP|VBP|DT|JJS occurs almost in positive sentence {1,0} 52
  • 53. TwitterSA: How Much is Accurate β€’ N-Fold Cross Validation (average results) β€’ Split corpus: 70% Training; 30% Test β€’ Manual corpus for testing 53
  • 54. TwitterSA: Testing Classification problem (not Retrieval) Precision: is positive predictive value, or correctly classified instance Recall: or Sensitivity is the proportion of actual positives which are correctly identified as such Confusion Matrix Classified Classified Positive Negative Predicted TP FN Positive Predicted FP TN Negative 54
  • 55. Outline β€’ Social Networks and Web 2.0 β€’ Sentiment analysis: what is it? β€’ Sentiment analysis: applications β€’ S.A. an inside look β€’ TwitterSA οƒΌ TwitterSA Soccer Match Analysis β€’ TwitterSA Predicting Elections 55
  • 56. TwitterSA: Soccer Match Analysis What Monitor soccer match on Twitter Milan – Inter, Seria A Season 2011-2012 Goal β€’ Understand with automatic sentiment analysis the behaviour of the match β€’ Who wins? How many goals? 56
  • 57. TwitterSA: Soccer Match Analysis Volume of tweets Inter and Milan 1600 1400 Match Start 1200 1000 800 Tweets 600 400 200 0 Mon Fri Sun Mon 57
  • 58. TwitterSA: Soccer Match Analysis Volume of tweets Inter and Milan 1600 1400 Match Start Coaches Interview 1200 1000 800 Tweets 600 400 200 0 Mon Fri Sun Mon 58
  • 59. TwitterSA: Soccer Match Analysis Volume of tweets Inter and Milan 1600 1400 Match Start Goal: which team? Coaches Interview 1200 1000 800 Tweets 600 400 200 0 Mon Fri Sun Mon 59
  • 60. TwitterSA: Soccer Match Analysis Sentiment Analysis 90 Goal: which team? 85 80 % Positive Tweets 75 70 Milan pos Inter pos 65 60 55 50 match day before tweet peak match end the day after timeline 60
  • 61. TwitterSA: Soccer Match Analysis β€’ Who won? β€’ How many goals? 61
  • 62. TwitterSA: Soccer Match Analysis β€’ Who won? β€’ How many goals? 62
  • 63. Outline β€’ Social Networks and Web 2.0 β€’ Sentiment analysis: what is it? β€’ Sentiment analysis: applications β€’ S.A. an inside look β€’ TwitterSA β€’ TwitterSA Soccer Match Analysis οƒΌ TwitterSA Predicting Elections 63
  • 70. TwitterSA: Predicting Elections Results of elections Full article and infographics at http://davidefeltoni.wordpress.com 70
  • 71. References β€’ Dr. Diana Maynard: Practical Sentiment Analysis β€’ Seth Grimes: Sentiment Analysis Symposium 2012 β€’ B. Liu reference is available here: http://www.cs.uic.edu/~liub/FBS/AAAI-2011-tutorial-references.pdf β€’ Best Survey about Sentiment Analysis: B. Liu β€˜Sentiment Analysis and Subjectivity’ chapter is available here: http://www.cs.uic.edu/~liub/FBS/NLP-handbook-sentiment- analysis.pdf β€’ Great tutorial for Sentiment Analysis: http://sentiment.christopherpotts.net/ β€’ Some images and statistics are taken from www.basistech.com, www.nielsen.com 71
  • 72. β€˜Quando in codesto sentire ti senti veramente felice, chiamalo pure come vuoi: chiamalo felicitΓ , cuore, amore. Per questo io non ho nome alcuno. Sentimento Γ¨ tutto! La parola Γ¨ soltanto suono e fumo.’’ Johann Wolfgang von Goethe 72