SlideShare a Scribd company logo
1 of 16
Download to read offline
Named Entity Recognition in Twitter:
A Dataset and Analysis on Short-Term Temporal Shifts
Computer Science & Informatics, Cardiff University, Cardiff NLP
Snap Inc.,
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
https://huggingface.co/datasets/tner/tweetner7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Named-Entity Recognition (NER)
2
Jacob Collier is an English artist.
BERT + Linear Projection
Jacob is English
an artist.
PJacob PCollier Pan PEnglish Partist
Pis
Collier
Tokenization
Location
Person
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
NER on Social Media
● Noisy Text
○ Less formal.
○ Different vocabulary.
○ Emoji, urls, etc.
● Temporal Shift
○ The meaning of words is constantly changing or evolving over time.
○ New entities in new period.
3
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
TweetNER7
TweetNER7 is a NER dataset on Twitter with 7 entity types.
● Person, Location, Corporation, Creative work, Group, Product.
● Tweets are collected from Sep 2019 to Aug 2021.
○ 2020-set: Sep 2019 ~ Aug 2020 (5,768 tweets)
○ 2021-set: Sep 2020 ~ Aug 2021 (5,612 tweets)
● Temporal shift setup
○ Training/Validation: 2020-set
○ Test: 2021-set
4
https://huggingface.co/datasets/tner/tweetner7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Other Twitter NER Datasets
TweetNER7 is unique for
● Uniform distribution over months
○ 2,000 tweets in each month
● (Relatively) short term temporal shift
○ Over 2 years
● Large annotation (data size & entities)
○ More than 10k annotated tweets with 7 entities
5
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Dataset Collection
TweetTopic tweet collection:
● Query 50 tweets every two hours
from September 2019 to October
2021
● Pre-filtering & Near de-duplication
● Trend-filter
6
Tweet collection pipeline.
Trend-filter
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Preprocessing Tweets
● Special token for URLs.
● Special representation for
usernames.
7
Raw tweet.
Processed tweet.
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Dataset Annotation
● Mechanical Turk:
○ 3 annotators per tweet
● Agreement
○ 1 / 3: Disregard
○ 2 / 3: Manual check
8
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Temporal Shift Setup
Setup: Train & validate on 2020-set & test
on 2021-set
Metric: Micro F1 / Macro F1
Result:
● RoBERTa is the best in 2021.
● BERTweet is the best in 2020.
● Performance: 2021 > 2020.
9
Model
Micro F1
(2021/2020)
Macro F1
(2021/2020)
BERTweet Base 64.1 / 66.4 59.4 / 62.4
BERTweet Large 64.0 / 65.9 59.5 / 62.6
RoBERTa Base 64.2 / 64.2 59.1 / 60.2
RoBERTa Large 64.8 / 65.7 60.0 / 61.9
TimeLM2019 64.3 / 65.4 59.3 / 61.1
TimeLM2020 62.9 / 64.4 58.3 / 60.3
TimeLM2021 64.2 / 65.4 59.5 / 61.1
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Results Breakdown
Easy entities:
Person, product, location
● Large number of entities
● Low diversity
Challenging entities:
Creative work, event
● Small number of entities
● High diversity
10
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Continuous Fine-tuning
Setup: Use both of 2020 & 2021 training
set by concatenation or continuous
fine-tuning.
Result:
● Continuous fine-tuning achieves
the best results.
● TimeLM19 is better than TimeLM20
and TimeLM21.
11
Model Dataset
Micro F1
(2021/2020)
Macro F1
(2021/2020)
RoBERTa
LARGE
2020-set 64.8 / 65.7 60.0 / 61.9
Continuous 66.0 / 66.3 60.9 / 62.4
TimeLM
2019
2020-set 64.3 / 65.4 59.3 / 61.1
Continuous 65.9 / 64.8 61.1 / 60.6
TimeLM
2020
2020-set 62.9 / 64.4 58.3 / 60.3
Continuous 65.5 / 65.3 60.6 / 61.3
TimeLM
2021
2020-set 64.2 / 65.4 59.5 / 61.1
Continuous 65.1 / 64.9 60.0 / 60.7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Self-labeling
Setup: Collect additional 93,594 and
878,80 tweets from the period of
2020-set and 2021-set. Generate pseudo
annotation by fine-tuned RoBERTa Large
model.
Result: No significant improvements.
12
Dataset
Micro F1
(2021)
Macro F1
(2021)
2020-set 64.8 60
2020-self-labeling 64.6 59.3
2021-self-labeling 64.2 59.3
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Are the pseudo labels not useful at all?
Setup: For incorrect prediction,
retrieve the pseudo-labels for the
entity within the range of 7 days,
and take the most frequent
prediction as the contextualized
prediction.
13
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Contextualized prediction usually
matches the original one, meaning is
wrong 😩
But, the second most frequent
prediction is correct in average 🤔
There is a signal at least, but not easy
way to utilize it.
14
Result of Contextualized Prediction
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Summary
● TweetNER7, a NER dataset on Twitter with temporal-shift and 7 entity types
● Report LM fine-tuning results
○ Temporal split vs Random split
○ Continuous fine-tuning
● Self-labeling
○ Contextual prediction
15
Thank you!!

More Related Content

Similar to 2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysisijtsrd
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyNeo4j
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyGreta Workman
 
Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011Lee Yount
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia UniversityTunghai University
 
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...Alexandre Sieira
 
Neurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons LearnedNeurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons LearnedStanford University
 
Bitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and ForecastingBitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and ForecastingIRJET Journal
 
The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream Inside Analysis
 
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSMPrivacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSMIIIT Hyderabad
 
Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23Chris Lundberg
 
Predicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social MediaPredicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social MediaJohn Cai
 
How Data Visualization Enhances the News
How Data Visualization Enhances the NewsHow Data Visualization Enhances the News
How Data Visualization Enhances the NewsInside Analysis
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Stanford University
 
No estimates - 10 new principles for testing
No estimates  - 10 new principles for testingNo estimates  - 10 new principles for testing
No estimates - 10 new principles for testingVasco Duarte
 
Semantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports TweetsSemantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports Tweetsmitsmit
 

Similar to 2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts (20)

Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
 
Blenderbot
BlenderbotBlenderbot
Blenderbot
 
Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011
 
bdSquared
bdSquaredbdSquared
bdSquared
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
 
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
 
Neurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons LearnedNeurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons Learned
 
Bitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and ForecastingBitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and Forecasting
 
Webinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven StorytellingWebinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven Storytelling
 
The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream
 
Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF
 
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSMPrivacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
 
Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23
 
Predicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social MediaPredicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social Media
 
How Data Visualization Enhances the News
How Data Visualization Enhances the NewsHow Data Visualization Enhances the News
How Data Visualization Enhances the News
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016
 
No estimates - 10 new principles for testing
No estimates  - 10 new principles for testingNo estimates  - 10 new principles for testing
No estimates - 10 new principles for testing
 
Semantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports TweetsSemantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports Tweets
 

More from asahiushio1

2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...asahiushio1
 
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...asahiushio1
 
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...asahiushio1
 
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...asahiushio1
 
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probingasahiushio1
 
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...asahiushio1
 
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...asahiushio1
 

More from asahiushio1 (7)

2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
 
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
 
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
 
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
 
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
 
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
 
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
 

Recently uploaded

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 

Recently uploaded (20)

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 

2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

  • 1. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Computer Science & Informatics, Cardiff University, Cardiff NLP Snap Inc., Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados https://huggingface.co/datasets/tner/tweetner7
  • 2. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Named-Entity Recognition (NER) 2 Jacob Collier is an English artist. BERT + Linear Projection Jacob is English an artist. PJacob PCollier Pan PEnglish Partist Pis Collier Tokenization Location Person
  • 3. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados NER on Social Media ● Noisy Text ○ Less formal. ○ Different vocabulary. ○ Emoji, urls, etc. ● Temporal Shift ○ The meaning of words is constantly changing or evolving over time. ○ New entities in new period. 3
  • 4. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados TweetNER7 TweetNER7 is a NER dataset on Twitter with 7 entity types. ● Person, Location, Corporation, Creative work, Group, Product. ● Tweets are collected from Sep 2019 to Aug 2021. ○ 2020-set: Sep 2019 ~ Aug 2020 (5,768 tweets) ○ 2021-set: Sep 2020 ~ Aug 2021 (5,612 tweets) ● Temporal shift setup ○ Training/Validation: 2020-set ○ Test: 2021-set 4 https://huggingface.co/datasets/tner/tweetner7
  • 5. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Other Twitter NER Datasets TweetNER7 is unique for ● Uniform distribution over months ○ 2,000 tweets in each month ● (Relatively) short term temporal shift ○ Over 2 years ● Large annotation (data size & entities) ○ More than 10k annotated tweets with 7 entities 5
  • 6. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Dataset Collection TweetTopic tweet collection: ● Query 50 tweets every two hours from September 2019 to October 2021 ● Pre-filtering & Near de-duplication ● Trend-filter 6 Tweet collection pipeline. Trend-filter
  • 7. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Preprocessing Tweets ● Special token for URLs. ● Special representation for usernames. 7 Raw tweet. Processed tweet.
  • 8. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Dataset Annotation ● Mechanical Turk: ○ 3 annotators per tweet ● Agreement ○ 1 / 3: Disregard ○ 2 / 3: Manual check 8
  • 9. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Temporal Shift Setup Setup: Train & validate on 2020-set & test on 2021-set Metric: Micro F1 / Macro F1 Result: ● RoBERTa is the best in 2021. ● BERTweet is the best in 2020. ● Performance: 2021 > 2020. 9 Model Micro F1 (2021/2020) Macro F1 (2021/2020) BERTweet Base 64.1 / 66.4 59.4 / 62.4 BERTweet Large 64.0 / 65.9 59.5 / 62.6 RoBERTa Base 64.2 / 64.2 59.1 / 60.2 RoBERTa Large 64.8 / 65.7 60.0 / 61.9 TimeLM2019 64.3 / 65.4 59.3 / 61.1 TimeLM2020 62.9 / 64.4 58.3 / 60.3 TimeLM2021 64.2 / 65.4 59.5 / 61.1
  • 10. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Results Breakdown Easy entities: Person, product, location ● Large number of entities ● Low diversity Challenging entities: Creative work, event ● Small number of entities ● High diversity 10
  • 11. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Continuous Fine-tuning Setup: Use both of 2020 & 2021 training set by concatenation or continuous fine-tuning. Result: ● Continuous fine-tuning achieves the best results. ● TimeLM19 is better than TimeLM20 and TimeLM21. 11 Model Dataset Micro F1 (2021/2020) Macro F1 (2021/2020) RoBERTa LARGE 2020-set 64.8 / 65.7 60.0 / 61.9 Continuous 66.0 / 66.3 60.9 / 62.4 TimeLM 2019 2020-set 64.3 / 65.4 59.3 / 61.1 Continuous 65.9 / 64.8 61.1 / 60.6 TimeLM 2020 2020-set 62.9 / 64.4 58.3 / 60.3 Continuous 65.5 / 65.3 60.6 / 61.3 TimeLM 2021 2020-set 64.2 / 65.4 59.5 / 61.1 Continuous 65.1 / 64.9 60.0 / 60.7
  • 12. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Self-labeling Setup: Collect additional 93,594 and 878,80 tweets from the period of 2020-set and 2021-set. Generate pseudo annotation by fine-tuned RoBERTa Large model. Result: No significant improvements. 12 Dataset Micro F1 (2021) Macro F1 (2021) 2020-set 64.8 60 2020-self-labeling 64.6 59.3 2021-self-labeling 64.2 59.3
  • 13. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Are the pseudo labels not useful at all? Setup: For incorrect prediction, retrieve the pseudo-labels for the entity within the range of 7 days, and take the most frequent prediction as the contextualized prediction. 13
  • 14. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Contextualized prediction usually matches the original one, meaning is wrong 😩 But, the second most frequent prediction is correct in average 🤔 There is a signal at least, but not easy way to utilize it. 14 Result of Contextualized Prediction
  • 15. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Summary ● TweetNER7, a NER dataset on Twitter with temporal-shift and 7 entity types ● Report LM fine-tuning results ○ Temporal split vs Random split ○ Continuous fine-tuning ● Self-labeling ○ Contextual prediction 15