SlideShare a Scribd company logo
1 of 3
Download to read offline
Sarcasm Detection
David Boyhan
CS-410
December 9, 2016
University of Illinois -
Urbana Champaign
MCS – DSO
Dboyhan2@illinois.edu
ABSTRACT
In this paper, I describe efforts to use natural language processing
techniques to discern sarcastic comments from non-sarcastic
comments
Keywords
Sentiment Analysis, Sarcasm, Natural Language Processing
1. INTRODUCTION
Sentiment analysis is a branch of natural language processing
(“NLP”) intended to identify the “polarity” (positive or negative)
of a comment or communication. This is useful in situations such
as customer support and product review analysis to determine if
there is positive or negative feedback being submitted. However,
sentiment analysis is frequently stymied by sarcastic comments.
The project is an attempt to use different NLP techniques to
discriminate sarcastic from non-sarcastic comments.
2. DEFINITIONS
Sarcasm is defined by the Oxford English Dictionary as “A sharp,
bitter, or cutting expression or remark; a bitter gibe or taunt.” [1]
However, in more common usage, sarcasm is typically conflated
with an intentionally ironic statement. [2] In other words, sarcasm
will most typically be a statement that is intentionally the exact
opposite of the literal meaning of the statement. For example, “I
love waiting on line” or “I hate receiving thoughtful gifts.” In
each case, the communicator or speaker obviously means the
opposite of the actual statement.
In NLP terms, sarcasm can be defined as false positive or false
negative sentiment classification. That is, using conventional
sentiment analysis would yield the wrong result for a sarcastic
comment.
3. CREATING OR FINDING A CORPUS
3.1 Web-based Tagging
The first part of this project, as proposed, was to develop a web-
based tool for marking sarcastic comments from a browser (such
as tweets or user comments) and then, using a JavaScript
bookmarklet or a Chrome extension, tagging the highlighted text
and automatically adding it to a “sarcasm corpus.” Unfortunately,
this portion of the project was not at all successful. JavaScript in
particular and web-browsers in general are specifically designed
to isolate the web from the native OS. As a result, although it is
relatively easy to use JavaScript to manipulate highlighted text
internally, including copying into the OS’ clipboard, it is
extremely difficult to then take that text and write it to a local file.
This would be a significant security risk and although it would be
possible to develop work-arounds, such as OS specific apps that
automatically “sweep” clipboard contents into a local file, this
would no longer be web-based only or OS agnostic and also
seemed to be significantly beyond the scope of CS410. As a
result, this portion of the project was suspended.
3.2 Pre-Tagged Sarcasm Corpora
In the absence of a tool and resources to develop a large tagged
sarcasm corpora, it was necessary to find pre-tagged corpora
elsewhere. Two resources in particular were identified and
reviewed. The first was developed at Fordham University, [3] and
was developed by using sentiment analysis on Amazon product
reviews together with the number of stars assigned to the product
in the review. In simplest terms, the corpus identified ironic or
sarcastic comments by cross-indexing positive polarity to fewer
stars. [4] My concern in using this corpus was that in reviewing
samples of the corpus, it did not appear that there were many
features that would help develop a sarcasm model beyond the star
ratings.
Accordingly, I identified a second tagged sarcasm corpus, which I
subsequently selected for analysis. This second corpus was
developed at the University of Santa Cruz [5] and included
quotations and responses mined from political debates on multiple
internet forums and tagged through Amazon’s Mechanical Turk
into “sarcastic” and “not-sarcastic” categories. There were
approximately 5,000 tagged comment/response pairs, split
roughly 50/50 between sarcastic and non-sarcastic.
4. PROCESSING & ANALYSIS
In order to build an NLP model using the tagged corpus, I used
the WEKA 3 Data Mining toolset [6]. WEKA has a number of
NLP tools built in and has a package management tool that allows
the addition of other algorithms, such as SVM.
In order to use WEKA, the corpus needed to be converted from its
tagged CSV format into WEKA custom arff format. As a part of
that conversion, I prepared different sub-corpora to work with:
Quote and Response – Stop Words Removed
Quote and Response – Stop Words Retained
Quote Only – Stop Words Removed
Response Only – Stop Words Retained
I then applied multiple NLP categorization algorithms to the
different corpora, including naïve Bayes, k-nearest neighbor
(“KNN”) and support vector machines (“SVM”). Unfortunately,
as shown below, the results were significantly poorer than I
anticipated:
Naive Bayes
Correctly
Classified
2424 51.66%
Incorrectly
Classified
2268 48.34%
True
Positive
False
Positive
Precision Recall F-Measure
Sarcastic 0.613 0.58 0.514 0.613 0.559
Not
Sarcastic
0.42 0.387 0.521 0.42 0.465
Weighted
Avg.
0.517 0.483 0.517 0.517 0.512
KNN
Correctly
Classified
2431 51.81%
Incorrectly
Classified
2261 48.19%
True
Positive
False
Positive
Precision Recall F-Measure
Sarcastic 0.422 0.385 0.522 0.422 0.467
Not
Sarcastic
0.615 0.578 0.515 0.615 0.561
Weighted
Avg.
0.518 0.482 0.519 0.518 0.514
SVM
Correctly
Classified
2356 50.21%
Incorrectly
Classified
2336 49.79%
True
Positive
False
Positive
Precision Recall F-Measure
Sarcastic 0.405 0.401 0.503 0.405 0.449
Not
Sarcastic
0.599 0.595 0.502 0.599 0.546
Weighted
Avg.
0.502 0.498 0.502 0.502 0.497
Clearly, based on both Precision and F-Measure, results were
extremely mediocre, bordering on little better than coin flipping.
What was particularly surprising about these results was that use
of presumably more “reliable” algorithms, such as KNN and
SVM actually yielded poorer results that algorithms considered
less robust, such as naïve Bayes.
These results were consistent across all four corporate corpora.
Further attempts to improve performance, such as retaining or
eliminating punctuation, layering models and similar efforts were
equally disappointing.
5. BARRIERS TO RECOGNIZING
SARCASM
As a result of the poor results using various algorithms, I began
researching in more detail the issues associated with sarcasm
recognition.
5.1 Humans Are Very Bad at Recognizing
Sarcasm
A significant body of research supports the assertion that human
beings are almost as bad at identifying written sarcasm as the
models discussed above. Anecdotally, there are two separate
“wiki-how” entries on identifying sarcasm [7] and in 2014, the
U.S. Secret Service put in a request for software that would
identify sarcasm in communications. [8] In a 2005 study, only
56% of participants correctly identified sarcastic vs. non-sarcastic
comments when sent as a message. [9]
In a 2006 survey, 55% of respondents incorrectly believed they
were providing an example of a sarcastic comment. [10]
The 2005 study found that when the same messages were
transmitted through a voice recording, the recipient interpreted the
emotion correctly 73% of the time (consistent with senders’
expectations.) [11].
Two features in particular, are critical in correctly identifying
sarcasm in written communications: context and emphasis. [12]
5.2 Context
The identity, beliefs and demographics of the communicator are
essential to correctly identifying sarcasm.
For example, typically, a 10-year-old boy’s views on the latest
Transformers movie is likely to be very different from those of a
45-year-old woman’s. This would be essential in determining
whether the phrase “I can’t wait to spend my Saturday at the new
Transformers movie” is sarcastic or not.
Similarly, a positive comment about an entertainer posted on a
fan-site is far more likely to be genuine (non-sarcastic) than one
posted on a site devoted to critics of that entertainer. Capturing
this context may be simple for human beings. However, in the
NLP arena, with the exception of posting location, most, if not all
of the context associated with the communicator is hidden.
5.3 Emphasis
Emphasis (or, in the case of vocal communication, facial
expression or speech inflection) are similarly critical in
identifying sarcasm, especially in the absence of another context.
In electronic communications, the author will often attempt to
provide emphasis through the use of punctuation, spacing, font
formatting (capitalization, italics, bold, underlining) and non-
word emoticons (e.g., the ubiquitous colon-parenthesis side-ways
smiley-face). [13]
For example, compare the sentences “Oh, yes, he did an excellent
job.” And “Oh yes … he did an EXCELLENT job …” Although
the only difference is formatting and punctuation, the difference
in intent is obvious. In typical NLP pre-processing, most if not all
of this information is discarded.
6. Re-Examining the Corpus
Unfortunately, as discussed above, various attempts to improve
model performance, including retaining stop-words and limiting
the analysis to the initial comment or the response were
unsuccessful. There were also no apparent methods for adding
additional context or finding emphasis which had been lost.
Consequently, I felt it was appropriate to re-examine the tagged
corpus. Although I had performed an initial review and validation
of the corpus, I did not review a significant number of the tags.
The tags themselves were provided by Amazon Mechanical Turk
reviewers. I therefore reviewed approximately 10% of the overall
tags applied (500 tags). The results were surprising, but, perhaps
should not have been given research on human assessment of
written communications for sarcasm.
Of the 500 tags reviewed, I felt that 239 were incorrectly tagged.
For example, the following statement and response was tagged as
sarcastic:
Quote: “Still dumb to compare the two. Both are no
good. Theft is better then murder...however a theif
bragging because he didn't murder is a fool..Chris Rock
actually did a funny skit on this.”
Response: “So why did you say pot was worse than
alcohol? Just because both are apparently bad, doesn't
make one 'worse'. I'll point out that you compared the
two first, so saying it's dumb to compare the two's
shooting yourself in the foot.”
There does not appear to be any irony intended in either the initial
quote nor the response. Nor, in fact, does the exchange appear to
be particularly cutting or bitter.
Given that in a 10% sample, I found that approximately ½ of the
tags were subject to challenge, it is not surprising that none of the
NLP models used were any better. Further, the discrepancies were
completely consistent with the 2005 and 2006 studies discussed
above.
7. NEXT STEPS & CONCLUSION
In order to perform a more accurate analysis, there is a need for
either a more consistently tagged corpus or access to additional
context and emphasis information. Clearly, sarcasm identification
is highly subjective and is unlikely to be improved unless the
human classifiers have a more formal background in rhetoric.
Accordingly, it may be worthwhile exploring use of selected
“expert groups” in tagging documents rather than a broader group
such as the Mechanical Turk. A more difficult issue is one of
identifying communicator context and retaining written emphasis.
Unformatted “text only” services such as twitter will limit font
formatting but at least permit the expanded emoji set and
punctuation based formatting. Further, there may be significant
information about the individual communicator available through
their social media profile. Accordingly, a more realistic “next
step” would be to build a corpus that first creates a demographic
profile on a twitter user (or similar social media commentator)
and then associates tweets and comments with that person for
categorization. With both pieces of data, identification of sarcasm
may be significantly simpler. However, as noted above, using
comments in a vacuum without any context appears to be
comparable to or worse than human assessment and will likely
remain so.
8. REFERENCES
[1] Oxford English Dictionary (entry for “Sarcasm”)
http://www.oed.com/viewdictionaryentry/Entry/170938
[2] “Irony” - “The expression of one's meaning by using
language that normally signifies the opposite, typically for
humorous or emphatic effect; esp. (in earlier use) the use of
approbatory language to imply condemnation or contempt
(cf. sarcasm n.).” Oxford English Dictionary, (Online)
http://www.oed.com/view/Entry/99565#eid64994
[3] Sarcasm Corpus – Review of Amazon Products,
http://storm.cis.fordham.edu/~filatova/SarcasmCorpus.html
[4] Filatova, Elena, Irony and Sarcasm: Corpus Generation and
Analysis Using Crowdsourcing,
https://pdfs.semanticscholar.org/830f/d0969fbc1e5f2e112aa9
a948b50216150c0a.pdf
[5] Sarcasm Corpus – V2, https://nlds.soe.ucsc.edu/sarcasm2
[6] Weka 3: Data Mining in Java,
http://www.cs.waikato.ac.nz/ml/weka/
[7] http://www.wikihow.com/Detect-Sarcasm-in-Writing and
http://www.wikihow.com/Tell-if-Someone-Is-Being-
Sarcastic
[8] Zezima, Katie, “The Fix: The Secret Service wants software
that detects social media sarcasm. Yeah, sure it will work.”,
The Washington Post, June 3, 2014
[9] Kruger, Justin; Epley, Nicholas; Parker, Jason; Ng, Zhi-Wen,
Egocentrism over e-mail: Can we communicate as well as
we think?, Journal of Personality and Social Psychology, Vol
89(6), Dec 2005, 925-936 http://dx.doi.org/10.1037/0022-
3514.89.6.925
[10] Galinsky, A. D., Magee, J. C., Inesi, M. E., & Gruenfeld, D.
H. (2006). Power and perspectives not taken. Psychological
Science, 17(12), 1068-1074. doi: 10.1111/j.1467-
9280.2006.01824.x
(http://pss.sagepub.com/content/17/12/1068.full?oauth-
code=M6R720OaLlROh7BdyvSikpQgSwrU8ZZi)
[11] Kruger, et.al.
[12] Riordan, M. A., & Trichtinger, L. A. (2016). Overconfidence
at the Keyboard: Confidence and accuracy in interpreting
affect in e‐mail exchanges. Human Communication
Research. doi: 10.1111/hcre.12093
[13] Thompson, D., & Filik, R. (2016). Sarcasm in written
communication: Emoticons are efficient markers of
intention. Journal of Computer‐Mediated Communication,
21(2), 105-120. doi: 10.1111/jcc4.12156

More Related Content

What's hot

Ensemble learning
Ensemble learningEnsemble learning
Ensemble learningHaris Jamil
 
Detecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer softwareDetecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer softwareAshish Arora
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining Sulman Ahmed
 
Naive bayes
Naive bayesNaive bayes
Naive bayesumeskath
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset PreparationAndrew Ferlitsch
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...Anmol Bhasin
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysisBob Prieto
 
Machine Learning project presentation
Machine Learning project presentationMachine Learning project presentation
Machine Learning project presentationRamandeep Kaur Bagri
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression TreesHemant Chetwani
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithmRashid Ansari
 
Recommendation system
Recommendation system Recommendation system
Recommendation system Vikrant Arya
 
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...Simplilearn
 
Fake news detection project
Fake news detection projectFake news detection project
Fake news detection projectHarshdaGhai
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony
 

What's hot (20)

Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
Learning from imbalanced data
Learning from imbalanced data Learning from imbalanced data
Learning from imbalanced data
 
Detecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer softwareDetecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer software
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT
 
Phd thesis final presentation
Phd thesis   final presentationPhd thesis   final presentation
Phd thesis final presentation
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset Preparation
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysis
 
Machine Learning project presentation
Machine Learning project presentationMachine Learning project presentation
Machine Learning project presentation
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithm
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
 
Fake news detection project
Fake news detection projectFake news detection project
Fake news detection project
 
Session-Based Recommender Systems
Session-Based Recommender SystemsSession-Based Recommender Systems
Session-Based Recommender Systems
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 

Similar to Sarcasm Detection

Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentimentComparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentimentAI Publications
 
Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...redpel dot com
 
Effectiveness of Bug Bounty Program in preventing security v
Effectiveness of Bug Bounty Program in preventing security vEffectiveness of Bug Bounty Program in preventing security v
Effectiveness of Bug Bounty Program in preventing security vEvonCanales257
 
intro to sentementel analysis
intro to sentementel analysisintro to sentementel analysis
intro to sentementel analysisbhardwaj86
 
Intro to Sentiment Analysis
Intro to Sentiment AnalysisIntro to Sentiment Analysis
Intro to Sentiment AnalysisBernard Goldbach
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papersAshish Kulkarni
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmIJSRD
 
Detection of Fraud Reviews for a Product
Detection of Fraud Reviews for a ProductDetection of Fraud Reviews for a Product
Detection of Fraud Reviews for a ProductIJSRD
 
Solving a business problem through text and sentiment mining
Solving a business problem through text and sentiment miningSolving a business problem through text and sentiment mining
Solving a business problem through text and sentiment miningONKAR JADHAV
 
Digital QA Summit - Karen N. Johnson - Testing Explained
Digital QA Summit - Karen N. Johnson - Testing ExplainedDigital QA Summit - Karen N. Johnson - Testing Explained
Digital QA Summit - Karen N. Johnson - Testing ExplainedKaren N. Johnson
 
Testing Taxonomies: Beyond Card Sorting
Testing Taxonomies: Beyond Card SortingTesting Taxonomies: Beyond Card Sorting
Testing Taxonomies: Beyond Card SortingAlberta Soranzo
 
College-Level Writing RUBRIC Criteria
College-Level Writing RUBRIC  CriteriaCollege-Level Writing RUBRIC  Criteria
College-Level Writing RUBRIC CriteriaLynellBull52
 
Recommendations and Statistics with Graph Databases
Recommendations and Statistics with Graph DatabasesRecommendations and Statistics with Graph Databases
Recommendations and Statistics with Graph DatabasesCalin Constantinov
 
Onw14 snowstormedition notes
Onw14 snowstormedition notesOnw14 snowstormedition notes
Onw14 snowstormedition notesRick Stoddart
 
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleSteve Karam
 

Similar to Sarcasm Detection (20)

sent_analysis_report
sent_analysis_reportsent_analysis_report
sent_analysis_report
 
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentimentComparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
 
Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...
 
Effectiveness of Bug Bounty Program in preventing security v
Effectiveness of Bug Bounty Program in preventing security vEffectiveness of Bug Bounty Program in preventing security v
Effectiveness of Bug Bounty Program in preventing security v
 
Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1
 
Sentiment analysis on unstructured review
Sentiment analysis on unstructured reviewSentiment analysis on unstructured review
Sentiment analysis on unstructured review
 
intro to sentementel analysis
intro to sentementel analysisintro to sentementel analysis
intro to sentementel analysis
 
Intro to Sentiment Analysis
Intro to Sentiment AnalysisIntro to Sentiment Analysis
Intro to Sentiment Analysis
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
Detection of Fraud Reviews for a Product
Detection of Fraud Reviews for a ProductDetection of Fraud Reviews for a Product
Detection of Fraud Reviews for a Product
 
Solving a business problem through text and sentiment mining
Solving a business problem through text and sentiment miningSolving a business problem through text and sentiment mining
Solving a business problem through text and sentiment mining
 
Digital QA Summit - Karen N. Johnson - Testing Explained
Digital QA Summit - Karen N. Johnson - Testing ExplainedDigital QA Summit - Karen N. Johnson - Testing Explained
Digital QA Summit - Karen N. Johnson - Testing Explained
 
Testing Taxonomies: Beyond Card Sorting
Testing Taxonomies: Beyond Card SortingTesting Taxonomies: Beyond Card Sorting
Testing Taxonomies: Beyond Card Sorting
 
Fyp ca2
Fyp ca2Fyp ca2
Fyp ca2
 
College-Level Writing RUBRIC Criteria
College-Level Writing RUBRIC  CriteriaCollege-Level Writing RUBRIC  Criteria
College-Level Writing RUBRIC Criteria
 
Recommendations and Statistics with Graph Databases
Recommendations and Statistics with Graph DatabasesRecommendations and Statistics with Graph Databases
Recommendations and Statistics with Graph Databases
 
Onw14 snowstormedition notes
Onw14 snowstormedition notesOnw14 snowstormedition notes
Onw14 snowstormedition notes
 
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in Oracle
 
ACL-IJCNLP 2015
ACL-IJCNLP 2015ACL-IJCNLP 2015
ACL-IJCNLP 2015
 

Recently uploaded

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Sarcasm Detection

  • 1. Sarcasm Detection David Boyhan CS-410 December 9, 2016 University of Illinois - Urbana Champaign MCS – DSO Dboyhan2@illinois.edu ABSTRACT In this paper, I describe efforts to use natural language processing techniques to discern sarcastic comments from non-sarcastic comments Keywords Sentiment Analysis, Sarcasm, Natural Language Processing 1. INTRODUCTION Sentiment analysis is a branch of natural language processing (“NLP”) intended to identify the “polarity” (positive or negative) of a comment or communication. This is useful in situations such as customer support and product review analysis to determine if there is positive or negative feedback being submitted. However, sentiment analysis is frequently stymied by sarcastic comments. The project is an attempt to use different NLP techniques to discriminate sarcastic from non-sarcastic comments. 2. DEFINITIONS Sarcasm is defined by the Oxford English Dictionary as “A sharp, bitter, or cutting expression or remark; a bitter gibe or taunt.” [1] However, in more common usage, sarcasm is typically conflated with an intentionally ironic statement. [2] In other words, sarcasm will most typically be a statement that is intentionally the exact opposite of the literal meaning of the statement. For example, “I love waiting on line” or “I hate receiving thoughtful gifts.” In each case, the communicator or speaker obviously means the opposite of the actual statement. In NLP terms, sarcasm can be defined as false positive or false negative sentiment classification. That is, using conventional sentiment analysis would yield the wrong result for a sarcastic comment. 3. CREATING OR FINDING A CORPUS 3.1 Web-based Tagging The first part of this project, as proposed, was to develop a web- based tool for marking sarcastic comments from a browser (such as tweets or user comments) and then, using a JavaScript bookmarklet or a Chrome extension, tagging the highlighted text and automatically adding it to a “sarcasm corpus.” Unfortunately, this portion of the project was not at all successful. JavaScript in particular and web-browsers in general are specifically designed to isolate the web from the native OS. As a result, although it is relatively easy to use JavaScript to manipulate highlighted text internally, including copying into the OS’ clipboard, it is extremely difficult to then take that text and write it to a local file. This would be a significant security risk and although it would be possible to develop work-arounds, such as OS specific apps that automatically “sweep” clipboard contents into a local file, this would no longer be web-based only or OS agnostic and also seemed to be significantly beyond the scope of CS410. As a result, this portion of the project was suspended. 3.2 Pre-Tagged Sarcasm Corpora In the absence of a tool and resources to develop a large tagged sarcasm corpora, it was necessary to find pre-tagged corpora elsewhere. Two resources in particular were identified and reviewed. The first was developed at Fordham University, [3] and was developed by using sentiment analysis on Amazon product reviews together with the number of stars assigned to the product in the review. In simplest terms, the corpus identified ironic or sarcastic comments by cross-indexing positive polarity to fewer stars. [4] My concern in using this corpus was that in reviewing samples of the corpus, it did not appear that there were many features that would help develop a sarcasm model beyond the star ratings. Accordingly, I identified a second tagged sarcasm corpus, which I subsequently selected for analysis. This second corpus was developed at the University of Santa Cruz [5] and included quotations and responses mined from political debates on multiple internet forums and tagged through Amazon’s Mechanical Turk into “sarcastic” and “not-sarcastic” categories. There were approximately 5,000 tagged comment/response pairs, split roughly 50/50 between sarcastic and non-sarcastic. 4. PROCESSING & ANALYSIS In order to build an NLP model using the tagged corpus, I used the WEKA 3 Data Mining toolset [6]. WEKA has a number of NLP tools built in and has a package management tool that allows the addition of other algorithms, such as SVM. In order to use WEKA, the corpus needed to be converted from its tagged CSV format into WEKA custom arff format. As a part of that conversion, I prepared different sub-corpora to work with: Quote and Response – Stop Words Removed Quote and Response – Stop Words Retained Quote Only – Stop Words Removed Response Only – Stop Words Retained I then applied multiple NLP categorization algorithms to the different corpora, including naïve Bayes, k-nearest neighbor (“KNN”) and support vector machines (“SVM”). Unfortunately,
  • 2. as shown below, the results were significantly poorer than I anticipated: Naive Bayes Correctly Classified 2424 51.66% Incorrectly Classified 2268 48.34% True Positive False Positive Precision Recall F-Measure Sarcastic 0.613 0.58 0.514 0.613 0.559 Not Sarcastic 0.42 0.387 0.521 0.42 0.465 Weighted Avg. 0.517 0.483 0.517 0.517 0.512 KNN Correctly Classified 2431 51.81% Incorrectly Classified 2261 48.19% True Positive False Positive Precision Recall F-Measure Sarcastic 0.422 0.385 0.522 0.422 0.467 Not Sarcastic 0.615 0.578 0.515 0.615 0.561 Weighted Avg. 0.518 0.482 0.519 0.518 0.514 SVM Correctly Classified 2356 50.21% Incorrectly Classified 2336 49.79% True Positive False Positive Precision Recall F-Measure Sarcastic 0.405 0.401 0.503 0.405 0.449 Not Sarcastic 0.599 0.595 0.502 0.599 0.546 Weighted Avg. 0.502 0.498 0.502 0.502 0.497 Clearly, based on both Precision and F-Measure, results were extremely mediocre, bordering on little better than coin flipping. What was particularly surprising about these results was that use of presumably more “reliable” algorithms, such as KNN and SVM actually yielded poorer results that algorithms considered less robust, such as naïve Bayes. These results were consistent across all four corporate corpora. Further attempts to improve performance, such as retaining or eliminating punctuation, layering models and similar efforts were equally disappointing. 5. BARRIERS TO RECOGNIZING SARCASM As a result of the poor results using various algorithms, I began researching in more detail the issues associated with sarcasm recognition. 5.1 Humans Are Very Bad at Recognizing Sarcasm A significant body of research supports the assertion that human beings are almost as bad at identifying written sarcasm as the models discussed above. Anecdotally, there are two separate “wiki-how” entries on identifying sarcasm [7] and in 2014, the U.S. Secret Service put in a request for software that would identify sarcasm in communications. [8] In a 2005 study, only 56% of participants correctly identified sarcastic vs. non-sarcastic comments when sent as a message. [9] In a 2006 survey, 55% of respondents incorrectly believed they were providing an example of a sarcastic comment. [10] The 2005 study found that when the same messages were transmitted through a voice recording, the recipient interpreted the emotion correctly 73% of the time (consistent with senders’ expectations.) [11]. Two features in particular, are critical in correctly identifying sarcasm in written communications: context and emphasis. [12] 5.2 Context The identity, beliefs and demographics of the communicator are essential to correctly identifying sarcasm. For example, typically, a 10-year-old boy’s views on the latest Transformers movie is likely to be very different from those of a 45-year-old woman’s. This would be essential in determining whether the phrase “I can’t wait to spend my Saturday at the new Transformers movie” is sarcastic or not. Similarly, a positive comment about an entertainer posted on a fan-site is far more likely to be genuine (non-sarcastic) than one posted on a site devoted to critics of that entertainer. Capturing this context may be simple for human beings. However, in the NLP arena, with the exception of posting location, most, if not all of the context associated with the communicator is hidden. 5.3 Emphasis Emphasis (or, in the case of vocal communication, facial expression or speech inflection) are similarly critical in identifying sarcasm, especially in the absence of another context. In electronic communications, the author will often attempt to provide emphasis through the use of punctuation, spacing, font formatting (capitalization, italics, bold, underlining) and non- word emoticons (e.g., the ubiquitous colon-parenthesis side-ways smiley-face). [13] For example, compare the sentences “Oh, yes, he did an excellent job.” And “Oh yes … he did an EXCELLENT job …” Although the only difference is formatting and punctuation, the difference in intent is obvious. In typical NLP pre-processing, most if not all of this information is discarded. 6. Re-Examining the Corpus Unfortunately, as discussed above, various attempts to improve model performance, including retaining stop-words and limiting the analysis to the initial comment or the response were unsuccessful. There were also no apparent methods for adding additional context or finding emphasis which had been lost. Consequently, I felt it was appropriate to re-examine the tagged corpus. Although I had performed an initial review and validation of the corpus, I did not review a significant number of the tags. The tags themselves were provided by Amazon Mechanical Turk reviewers. I therefore reviewed approximately 10% of the overall tags applied (500 tags). The results were surprising, but, perhaps should not have been given research on human assessment of written communications for sarcasm. Of the 500 tags reviewed, I felt that 239 were incorrectly tagged. For example, the following statement and response was tagged as sarcastic:
  • 3. Quote: “Still dumb to compare the two. Both are no good. Theft is better then murder...however a theif bragging because he didn't murder is a fool..Chris Rock actually did a funny skit on this.” Response: “So why did you say pot was worse than alcohol? Just because both are apparently bad, doesn't make one 'worse'. I'll point out that you compared the two first, so saying it's dumb to compare the two's shooting yourself in the foot.” There does not appear to be any irony intended in either the initial quote nor the response. Nor, in fact, does the exchange appear to be particularly cutting or bitter. Given that in a 10% sample, I found that approximately ½ of the tags were subject to challenge, it is not surprising that none of the NLP models used were any better. Further, the discrepancies were completely consistent with the 2005 and 2006 studies discussed above. 7. NEXT STEPS & CONCLUSION In order to perform a more accurate analysis, there is a need for either a more consistently tagged corpus or access to additional context and emphasis information. Clearly, sarcasm identification is highly subjective and is unlikely to be improved unless the human classifiers have a more formal background in rhetoric. Accordingly, it may be worthwhile exploring use of selected “expert groups” in tagging documents rather than a broader group such as the Mechanical Turk. A more difficult issue is one of identifying communicator context and retaining written emphasis. Unformatted “text only” services such as twitter will limit font formatting but at least permit the expanded emoji set and punctuation based formatting. Further, there may be significant information about the individual communicator available through their social media profile. Accordingly, a more realistic “next step” would be to build a corpus that first creates a demographic profile on a twitter user (or similar social media commentator) and then associates tweets and comments with that person for categorization. With both pieces of data, identification of sarcasm may be significantly simpler. However, as noted above, using comments in a vacuum without any context appears to be comparable to or worse than human assessment and will likely remain so. 8. REFERENCES [1] Oxford English Dictionary (entry for “Sarcasm”) http://www.oed.com/viewdictionaryentry/Entry/170938 [2] “Irony” - “The expression of one's meaning by using language that normally signifies the opposite, typically for humorous or emphatic effect; esp. (in earlier use) the use of approbatory language to imply condemnation or contempt (cf. sarcasm n.).” Oxford English Dictionary, (Online) http://www.oed.com/view/Entry/99565#eid64994 [3] Sarcasm Corpus – Review of Amazon Products, http://storm.cis.fordham.edu/~filatova/SarcasmCorpus.html [4] Filatova, Elena, Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing, https://pdfs.semanticscholar.org/830f/d0969fbc1e5f2e112aa9 a948b50216150c0a.pdf [5] Sarcasm Corpus – V2, https://nlds.soe.ucsc.edu/sarcasm2 [6] Weka 3: Data Mining in Java, http://www.cs.waikato.ac.nz/ml/weka/ [7] http://www.wikihow.com/Detect-Sarcasm-in-Writing and http://www.wikihow.com/Tell-if-Someone-Is-Being- Sarcastic [8] Zezima, Katie, “The Fix: The Secret Service wants software that detects social media sarcasm. Yeah, sure it will work.”, The Washington Post, June 3, 2014 [9] Kruger, Justin; Epley, Nicholas; Parker, Jason; Ng, Zhi-Wen, Egocentrism over e-mail: Can we communicate as well as we think?, Journal of Personality and Social Psychology, Vol 89(6), Dec 2005, 925-936 http://dx.doi.org/10.1037/0022- 3514.89.6.925 [10] Galinsky, A. D., Magee, J. C., Inesi, M. E., & Gruenfeld, D. H. (2006). Power and perspectives not taken. Psychological Science, 17(12), 1068-1074. doi: 10.1111/j.1467- 9280.2006.01824.x (http://pss.sagepub.com/content/17/12/1068.full?oauth- code=M6R720OaLlROh7BdyvSikpQgSwrU8ZZi) [11] Kruger, et.al. [12] Riordan, M. A., & Trichtinger, L. A. (2016). Overconfidence at the Keyboard: Confidence and accuracy in interpreting affect in e‐mail exchanges. Human Communication Research. doi: 10.1111/hcre.12093 [13] Thompson, D., & Filik, R. (2016). Sarcasm in written communication: Emoticons are efficient markers of intention. Journal of Computer‐Mediated Communication, 21(2), 105-120. doi: 10.1111/jcc4.12156