Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

•

1 like•186 views

Poster presented at WiNLP workshop at ACL 2017 in Vancouver. Describes approaches to recognizing humorous tweets based on language models and LSTM.

Education

Who’s to say what’s funny?
A computer using Language Models and Deep Learning,
That’s Who!
Xinru Yan & Ted Pedersen
{yanxx418,tpederse}@d.umn.edu
Department of Computer Science University of Minnesota Duluth
The Problem
• Traditional humor detection: binary classiﬁcation
Our focus: learn a continuous and subjective sense of
humor from tweets submitted to the @midnight show in
response to hashtags by using Ngram Language
Models (LMs) and Deep Learning (DL) methods
• We participated in SemEval-2017 Task 6
#HashtagWars: Learning a Sense of Humor [1]:
• Tweets are in three baskets: top most funny tweet, next
nine most funny tweets and all remaining
• Two Subtasks
• A: Pairwise Comparison – a system should choose a
funnier of two tweets given a hashtag ﬁle
• B: Semi-Ranking – a system should categorize tweets
into the right baskets given a hashtag ﬁle
• Dataset
• Tweet Data: provided by the task, 106 hashtag ﬁles,
about 21,580 tokens
• News Data: We used 6.2 GB English news, about 2
million tokens [2]
Examples from #BreakUpIn5Words (Trigram LM, News Data)
Tweet @midnight LM DL
It’s not you, it’s meth. funniest funny
?
Hey, can we NOT talk? funny funny
You need your own Netﬂix funny not funny
Figured I’d try being happy. not funny funny
You’re a Mac, I’m PC not funny not funny
Language Models
Ngram LMs learn humor from training data and allow rank-
ing by assigning probability for each statement [3][4]
Language Model Results
We seek high accuracy for A and low distance for B.
Dataset Ngram Accuracy (A) Distance (B)
news 3 0.627 (4th) 0.872 (1st)
news 2 0.624 0.853
tweet 3 0.397 (8th) 0.967 (8th)
tweet 2 0.406 0.944
• The type and the quantity of the corpora is what
really matters –> more tweet data, less news data
• Bigram LMs performed slightly better than trigram LMs
–> Unigram and character level LMs
Deep Learning
• Humor relies on creative use of language which causes
too many OOV
• Jokes often include puns based on invented words
Barktender #DogJobs
Tinderella #UpdateAFairyTale
• Token-level LMs can not understand such puns
• Character-based CNNs (CharCNN) are not dependent
on observing tokens in training data
• Bigram and trigram LMs only use two or three preceding
words to predict the next word –> LSTMs are good at
making use of sequantial data such as text and are
designed for long-term dependencies
• Some hashtags require tweets to have more than three
words and some funny tweets are mostly made up of
common bigrams or trigrams
Complaining makes it better #AmericaIn4Words
Romantic dinners with the cats #BestWeekendIn5Words
• Ngram LMs do not include external knowledge such as
movie titles and song lyrics –> Create word embeddings
from domain speciﬁc materials
• Our plan: use Keras library to train CharCNN + LSTM
LMs on both datasets and investigate ways to include
domain knowledge word embeddings in the CharCNN +
LSTM LM
References
[1] Peter Potash, Alexey Romanov, and Anna Rumshisky.
SemEval-2017 Task 6: #HashtagWars: learning a sense of humor.
In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Vancouver, BC, August 2017.
[2] EMNLP 2011 SIXTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION.
http://www.statmt.org/wmt11/translation-task.html.
[3] Kenneth Heaﬁeld, Ivan Pouzyrevsky, Jonathan H. Clark, and Philipp Koehn.
Scalable modiﬁed Kneser-Ney language model estimation.
In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Soﬁa, Bulgaria, August 2013.
[4] Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg, and Lillian Lee.
You had me at hello: How phrasing aﬀects memorability.
In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, USA, 2012.

Similar to Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection University of Minnesota, Duluth

Weakly Supervised Machine ReadingIsabelle Augenstein

ashu ppt final.pptxVekariyaDarshana2

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics

Challenging Reading Comprehension on Daily Conversation: Passage Completion o...Jinho Choi

Sentiment Analysis of Twitter DataSumit Raj

Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...hajinouha0

How Anonymous Can Someone be on Twitter?George Sam

Deep learning for NLPShishir Choudhary

Classifying Non-Referential It for Question Answer PairsJinho Choi

April 10th of 2018 budapest presentationAhmet Bulut

AbstractSuresh Prabhu

NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta

Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri

On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...Savvas Zannettou

Natural Language Processing: L01 introductionananth

Aaai 1Tathagata Raha

Explore the Effects of Emoticons on Twitter Sentiment Analysis csandit

TextMiningTwittersLiu Chang

Similar to Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who! (20)

Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection

Weakly Supervised Machine Reading

ashu ppt final.pptx

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...

Challenging Reading Comprehension on Daily Conversation: Passage Completion o...

Sentiment Analysis of Twitter Data

Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...

How Anonymous Can Someone be on Twitter?

Deep learning for NLP

Classifying Non-Referential It for Question Answer Pairs

April 10th of 2018 budapest presentation

Abstract

NLP Bootcamp 2018 : Representation Learning of text for NLP

Natural Language Processing, Techniques, Current Trends and Applications in I...

On the Origins of Memes by Means of Fringe Web Communities - Invited talk ta ...

Natural Language Processing: L01 introduction

Aaai 1

Explore the Effects of Emoticons on Twitter Sentiment Analysis

TextMiningTwitters

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

Holdier Curriculum Vitae (April 2024).pdfagholdier

Advance Mobile Application Development class 07Dr. Mazin Mohamed alkathiri

APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82

Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron

Mattingly "AI & Prompt Design: The Basics of Prompt Design"National Information Standards Organization (NISO)

Activity 01 - Artificial Culture (1).pdfciinovamais

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxRAM LAL ANAND COLLEGE, DELHI UNIVERSITY.

fourth grading exam for kindergarten in writingTeacherCyreneCayanan

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"National Information Standards Organization (NISO)

Interactive Powerpoint_How to Master effective communicationnomboosow

Sports & Fitness Value Added Course FY..Disha Kariya

social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3

General AI for Medical Educators April 2024Janet Corral

Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB

Recently uploaded (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

Grant Readiness 101 TechSoup and Remy Consulting

Holdier Curriculum Vitae (April 2024).pdf

Advance Mobile Application Development class 07

APM Welcome, APM North West Network Conference, Synergies Across Sectors

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi

Q4-W6-Restating Informational Text Grade 3

Mattingly "AI & Prompt Design: The Basics of Prompt Design"

Activity 01 - Artificial Culture (1).pdf

Key note speaker Neum_Admir Softic_ENG.pdf

Disha NEET Physics Guide for classes 11 and 12.pdf

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx

fourth grading exam for kindergarten in writing

9548086042 for call girls in Indira Nagar with room service

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"

Interactive Powerpoint_How to Master effective communication

Sports & Fitness Value Added Course FY..

social pharmacy d-pharm 1st year by Pragati K. Mahajan

General AI for Medical Educators April 2024

Beyond the EU: DORA and NIS 2 Directive's Global Impact

Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

1. Who’s to say what’s funny? A computer using Language Models and Deep Learning, That’s Who! Xinru Yan & Ted Pedersen {yanxx418,tpederse}@d.umn.edu Department of Computer Science University of Minnesota Duluth The Problem • Traditional humor detection: binary classification Our focus: learn a continuous and subjective sense of humor from tweets submitted to the @midnight show in response to hashtags by using Ngram Language Models (LMs) and Deep Learning (DL) methods • We participated in SemEval-2017 Task 6 #HashtagWars: Learning a Sense of Humor [1]: • Tweets are in three baskets: top most funny tweet, next nine most funny tweets and all remaining • Two Subtasks • A: Pairwise Comparison – a system should choose a funnier of two tweets given a hashtag file • B: Semi-Ranking – a system should categorize tweets into the right baskets given a hashtag file • Dataset • Tweet Data: provided by the task, 106 hashtag files, about 21,580 tokens • News Data: We used 6.2 GB English news, about 2 million tokens [2] Examples from #BreakUpIn5Words (Trigram LM, News Data) Tweet @midnight LM DL It’s not you, it’s meth. funniest funny ? Hey, can we NOT talk? funny funny You need your own Netflix funny not funny Figured I’d try being happy. not funny funny You’re a Mac, I’m PC not funny not funny Language Models Ngram LMs learn humor from training data and allow ranking by assigning probability for each statement [3][4] Language Model Results We seek high accuracy for A and low distance for B. Dataset Ngram Accuracy (A) Distance (B) news 3 0.627 (4th) 0.872 (1st) news 2 0.624 0.853 tweet 3 0.397 (8th) 0.967 (8th) tweet 2 0.406 0.944 • The type and the quantity of the corpora is what really matters –> more tweet data, less news data • Bigram LMs performed slightly better than trigram LMs –> Unigram and character level LMs Deep Learning • Humor relies on creative use of language which causes too many OOV • Jokes often include puns based on invented words Barktender #DogJobs Tinderella #UpdateAFairyTale • Token-level LMs can not understand such puns • Character-based CNNs (CharCNN) are not dependent on observing tokens in training data • Bigram and trigram LMs only use two or three preceding words to predict the next word –> LSTMs are good at making use of sequantial data such as text and are designed for long-term dependencies • Some hashtags require tweets to have more than three words and some funny tweets are mostly made up of common bigrams or trigrams Complaining makes it better #AmericaIn4Words Romantic dinners with the cats #BestWeekendIn5Words • Ngram LMs do not include external knowledge such as movie titles and song lyrics –> Create word embeddings from domain specific materials • Our plan: use Keras library to train CharCNN + LSTM LMs on both datasets and investigate ways to include domain knowledge word embeddings in the CharCNN + LSTM LM References [1] Peter Potash, Alexey Romanov, and Anna Rumshisky. SemEval-2017 Task 6: #HashtagWars: learning a sense of humor. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Vancouver, BC, August 2017. [2] EMNLP 2011 SIXTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION. http://www.statmt.org/wmt11/translation-task.html. [3] Kenneth Heafield, Ivan Pouzyrevsky, Jonathan H. Clark, and Philipp Koehn. Scalable modified Kneser-Ney language model estimation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 2013. [4] Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg, and Lillian Lee. You had me at hello: How phrasing affects memorability. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, USA, 2012.

Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

Recommended

Recommended

More Related Content

Similar to Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

Similar to Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who! (20)

More from University of Minnesota, Duluth

More from University of Minnesota, Duluth (20)

Recently uploaded

Recently uploaded (20)

Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!