SlideShare a Scribd company logo
Named Entity Recognition in Twitter:
A Dataset and Analysis on Short-Term Temporal Shifts
Computer Science & Informatics, Cardiff University, Cardiff NLP
Snap Inc.,
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
https://huggingface.co/datasets/tner/tweetner7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Named-Entity Recognition (NER)
2
Jacob Collier is an English artist.
BERT + Linear Projection
Jacob is English
an artist.
PJacob PCollier Pan PEnglish Partist
Pis
Collier
Tokenization
Location
Person
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
NER on Social Media
● Noisy Text
○ Less formal.
○ Different vocabulary.
○ Emoji, urls, etc.
● Temporal Shift
○ The meaning of words is constantly changing or evolving over time.
○ New entities in new period.
3
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
TweetNER7
TweetNER7 is a NER dataset on Twitter with 7 entity types.
● Person, Location, Corporation, Creative work, Group, Product.
● Tweets are collected from Sep 2019 to Aug 2021.
○ 2020-set: Sep 2019 ~ Aug 2020 (5,768 tweets)
○ 2021-set: Sep 2020 ~ Aug 2021 (5,612 tweets)
● Temporal shift setup
○ Training/Validation: 2020-set
○ Test: 2021-set
4
https://huggingface.co/datasets/tner/tweetner7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Other Twitter NER Datasets
TweetNER7 is unique for
● Uniform distribution over months
○ 2,000 tweets in each month
● (Relatively) short term temporal shift
○ Over 2 years
● Large annotation (data size & entities)
○ More than 10k annotated tweets with 7 entities
5
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Dataset Collection
TweetTopic tweet collection:
● Query 50 tweets every two hours
from September 2019 to October
2021
● Pre-filtering & Near de-duplication
● Trend-filter
6
Tweet collection pipeline.
Trend-filter
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Preprocessing Tweets
● Special token for URLs.
● Special representation for
usernames.
7
Raw tweet.
Processed tweet.
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Dataset Annotation
● Mechanical Turk:
○ 3 annotators per tweet
● Agreement
○ 1 / 3: Disregard
○ 2 / 3: Manual check
8
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Temporal Shift Setup
Setup: Train & validate on 2020-set & test
on 2021-set
Metric: Micro F1 / Macro F1
Result:
● RoBERTa is the best in 2021.
● BERTweet is the best in 2020.
● Performance: 2021 > 2020.
9
Model
Micro F1
(2021/2020)
Macro F1
(2021/2020)
BERTweet Base 64.1 / 66.4 59.4 / 62.4
BERTweet Large 64.0 / 65.9 59.5 / 62.6
RoBERTa Base 64.2 / 64.2 59.1 / 60.2
RoBERTa Large 64.8 / 65.7 60.0 / 61.9
TimeLM2019 64.3 / 65.4 59.3 / 61.1
TimeLM2020 62.9 / 64.4 58.3 / 60.3
TimeLM2021 64.2 / 65.4 59.5 / 61.1
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Results Breakdown
Easy entities:
Person, product, location
● Large number of entities
● Low diversity
Challenging entities:
Creative work, event
● Small number of entities
● High diversity
10
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Continuous Fine-tuning
Setup: Use both of 2020 & 2021 training
set by concatenation or continuous
fine-tuning.
Result:
● Continuous fine-tuning achieves
the best results.
● TimeLM19 is better than TimeLM20
and TimeLM21.
11
Model Dataset
Micro F1
(2021/2020)
Macro F1
(2021/2020)
RoBERTa
LARGE
2020-set 64.8 / 65.7 60.0 / 61.9
Continuous 66.0 / 66.3 60.9 / 62.4
TimeLM
2019
2020-set 64.3 / 65.4 59.3 / 61.1
Continuous 65.9 / 64.8 61.1 / 60.6
TimeLM
2020
2020-set 62.9 / 64.4 58.3 / 60.3
Continuous 65.5 / 65.3 60.6 / 61.3
TimeLM
2021
2020-set 64.2 / 65.4 59.5 / 61.1
Continuous 65.1 / 64.9 60.0 / 60.7
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Experiment: Self-labeling
Setup: Collect additional 93,594 and
878,80 tweets from the period of
2020-set and 2021-set. Generate pseudo
annotation by fine-tuned RoBERTa Large
model.
Result: No significant improvements.
12
Dataset
Micro F1
(2021)
Macro F1
(2021)
2020-set 64.8 60
2020-self-labeling 64.6 59.3
2021-self-labeling 64.2 59.3
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Are the pseudo labels not useful at all?
Setup: For incorrect prediction,
retrieve the pseudo-labels for the
entity within the range of 7 days,
and take the most frequent
prediction as the contextualized
prediction.
13
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Contextualized prediction usually
matches the original one, meaning is
wrong 😩
But, the second most frequent
prediction is correct in average 🤔
There is a signal at least, but not easy
way to utilize it.
14
Result of Contextualized Prediction
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts
Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados
Summary
● TweetNER7, a NER dataset on Twitter with temporal-shift and 7 entity types
● Report LM fine-tuning results
○ Temporal split vs Random split
○ Continuous fine-tuning
● Self-labeling
○ Contextual prediction
15
Thank you!!

More Related Content

Similar to 2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
ijtsrd
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
Neo4j
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
Greta Workman
 
Blenderbot
BlenderbotBlenderbot
Blenderbot
taeseon ryu
 
Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011
Lee Yount
 
bdSquared
bdSquaredbdSquared
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
Tunghai University
 
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Alexandre Sieira
 
Neurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons LearnedNeurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons Learned
Stanford University
 
Bitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and ForecastingBitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and Forecasting
IRJET Journal
 
Webinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven StorytellingWebinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven Storytelling
LEWIS Global Communications
 
The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream
Inside Analysis
 
Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF
Kaliya "Identity Woman" Young
 
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSMPrivacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
IIIT Hyderabad
 
Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23
Chris Lundberg
 
Predicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social MediaPredicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social Media
John Cai
 
How Data Visualization Enhances the News
How Data Visualization Enhances the NewsHow Data Visualization Enhances the News
How Data Visualization Enhances the News
Inside Analysis
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016
Stanford University
 
No estimates - 10 new principles for testing
No estimates  - 10 new principles for testingNo estimates  - 10 new principles for testing
No estimates - 10 new principles for testing
Vasco Duarte
 
Semantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports TweetsSemantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports Tweets
mitsmit
 

Similar to 2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts (20)

Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
 
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph TechnologyThe Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
 
Blenderbot
BlenderbotBlenderbot
Blenderbot
 
Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011Social Media - Shock and Awe - NC3C Conference 3-30-2011
Social Media - Shock and Awe - NC3C Conference 3-30-2011
 
bdSquared
bdSquaredbdSquared
bdSquared
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
 
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
Threat Intelligence Baseada em Dados: Métricas de Disseminação e Compartilham...
 
Neurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons LearnedNeurodb Engr245 2021 Lessons Learned
Neurodb Engr245 2021 Lessons Learned
 
Bitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and ForecastingBitcoin Price Prediction and Forecasting
Bitcoin Price Prediction and Forecasting
 
Webinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven StorytellingWebinar: The Future of Data-driven Storytelling
Webinar: The Future of Data-driven Storytelling
 
The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream The New Simple: Predictive Analytics for the Mainstream
The New Simple: Predictive Analytics for the Mainstream
 
Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF Blockchain v Cryptocurrency: Talk for BridgeSF
Blockchain v Cryptocurrency: Talk for BridgeSF
 
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSMPrivacy and Security in Online Social Media : Trust and Credebillity on OSM
Privacy and Security in Online Social Media : Trust and Credebillity on OSM
 
Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23Frakture 4 fold webinar presentation 2015-04_23
Frakture 4 fold webinar presentation 2015-04_23
 
Predicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social MediaPredicting Business Confidence using News and Social Media
Predicting Business Confidence using News and Social Media
 
How Data Visualization Enhances the News
How Data Visualization Enhances the NewsHow Data Visualization Enhances the News
How Data Visualization Enhances the News
 
Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016Narrative Mind Week 5 H4D Stanford 2016
Narrative Mind Week 5 H4D Stanford 2016
 
No estimates - 10 new principles for testing
No estimates  - 10 new principles for testingNo estimates  - 10 new principles for testing
No estimates - 10 new principles for testing
 
Semantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports TweetsSemantic Entity extraction from Sports Tweets
Semantic Entity extraction from Sports Tweets
 

More from asahiushio1

2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
asahiushio1
 
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
asahiushio1
 
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
asahiushio1
 
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
asahiushio1
 
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
asahiushio1
 
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
asahiushio1
 
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
asahiushio1
 

More from asahiushio1 (7)

2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
2022-10, UCL NLP meetup, Toward a Better Understanding of Relational Knowledg...
 
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
2017-12, Keio University, Projection-based Regularized Dual Averaging for Sto...
 
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
2017-07, Research Seminar at Keio University, Metric Perspective of Stochasti...
 
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
2017-03, ICASSP, Projection-based Dual Averaging for Stochastic Sparse Optimi...
 
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
2020-12, Cardiff NLP Reading Group, Commonsense Knowledge Probing
 
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
2021-05, ACL, BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language ...
 
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
2021-04, EACL, T-NER: An All-Round Python Library for Transformer-based Named...
 

Recently uploaded

快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
RAYMUNDONAVARROCORON
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
gastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptxgastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptx
Shekar Boddu
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
PirithiRaju
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
sandertein
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdfHUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
Ritik83251
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
ananya23nair
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
BIRDS DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
BIRDS  DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptxBIRDS  DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
BIRDS DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
goluk9330
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
Microbiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdfMicrobiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdf
sammy700571
 
Physiology of Nervous System presentation.pptx
Physiology of Nervous System presentation.pptxPhysiology of Nervous System presentation.pptx
Physiology of Nervous System presentation.pptx
fatima132662
 
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE  AND ITS BENIFITS.pptxIMPORTANCE OF ALGAE  AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
OmAle5
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 

Recently uploaded (20)

快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
gastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptxgastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptx
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdfHUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
BIRDS DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
BIRDS  DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptxBIRDS  DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
BIRDS DIVERSITY OF SOOTEA BISWANATH ASSAM.ppt.pptx
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
Microbiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdfMicrobiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdf
 
Physiology of Nervous System presentation.pptx
Physiology of Nervous System presentation.pptxPhysiology of Nervous System presentation.pptx
Physiology of Nervous System presentation.pptx
 
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE  AND ITS BENIFITS.pptxIMPORTANCE OF ALGAE  AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 

2022-11, AACL, Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

  • 1. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Computer Science & Informatics, Cardiff University, Cardiff NLP Snap Inc., Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados https://huggingface.co/datasets/tner/tweetner7
  • 2. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Named-Entity Recognition (NER) 2 Jacob Collier is an English artist. BERT + Linear Projection Jacob is English an artist. PJacob PCollier Pan PEnglish Partist Pis Collier Tokenization Location Person
  • 3. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados NER on Social Media ● Noisy Text ○ Less formal. ○ Different vocabulary. ○ Emoji, urls, etc. ● Temporal Shift ○ The meaning of words is constantly changing or evolving over time. ○ New entities in new period. 3
  • 4. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados TweetNER7 TweetNER7 is a NER dataset on Twitter with 7 entity types. ● Person, Location, Corporation, Creative work, Group, Product. ● Tweets are collected from Sep 2019 to Aug 2021. ○ 2020-set: Sep 2019 ~ Aug 2020 (5,768 tweets) ○ 2021-set: Sep 2020 ~ Aug 2021 (5,612 tweets) ● Temporal shift setup ○ Training/Validation: 2020-set ○ Test: 2021-set 4 https://huggingface.co/datasets/tner/tweetner7
  • 5. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Other Twitter NER Datasets TweetNER7 is unique for ● Uniform distribution over months ○ 2,000 tweets in each month ● (Relatively) short term temporal shift ○ Over 2 years ● Large annotation (data size & entities) ○ More than 10k annotated tweets with 7 entities 5
  • 6. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Dataset Collection TweetTopic tweet collection: ● Query 50 tweets every two hours from September 2019 to October 2021 ● Pre-filtering & Near de-duplication ● Trend-filter 6 Tweet collection pipeline. Trend-filter
  • 7. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Preprocessing Tweets ● Special token for URLs. ● Special representation for usernames. 7 Raw tweet. Processed tweet.
  • 8. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Dataset Annotation ● Mechanical Turk: ○ 3 annotators per tweet ● Agreement ○ 1 / 3: Disregard ○ 2 / 3: Manual check 8
  • 9. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Temporal Shift Setup Setup: Train & validate on 2020-set & test on 2021-set Metric: Micro F1 / Macro F1 Result: ● RoBERTa is the best in 2021. ● BERTweet is the best in 2020. ● Performance: 2021 > 2020. 9 Model Micro F1 (2021/2020) Macro F1 (2021/2020) BERTweet Base 64.1 / 66.4 59.4 / 62.4 BERTweet Large 64.0 / 65.9 59.5 / 62.6 RoBERTa Base 64.2 / 64.2 59.1 / 60.2 RoBERTa Large 64.8 / 65.7 60.0 / 61.9 TimeLM2019 64.3 / 65.4 59.3 / 61.1 TimeLM2020 62.9 / 64.4 58.3 / 60.3 TimeLM2021 64.2 / 65.4 59.5 / 61.1
  • 10. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Results Breakdown Easy entities: Person, product, location ● Large number of entities ● Low diversity Challenging entities: Creative work, event ● Small number of entities ● High diversity 10
  • 11. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Continuous Fine-tuning Setup: Use both of 2020 & 2021 training set by concatenation or continuous fine-tuning. Result: ● Continuous fine-tuning achieves the best results. ● TimeLM19 is better than TimeLM20 and TimeLM21. 11 Model Dataset Micro F1 (2021/2020) Macro F1 (2021/2020) RoBERTa LARGE 2020-set 64.8 / 65.7 60.0 / 61.9 Continuous 66.0 / 66.3 60.9 / 62.4 TimeLM 2019 2020-set 64.3 / 65.4 59.3 / 61.1 Continuous 65.9 / 64.8 61.1 / 60.6 TimeLM 2020 2020-set 62.9 / 64.4 58.3 / 60.3 Continuous 65.5 / 65.3 60.6 / 61.3 TimeLM 2021 2020-set 64.2 / 65.4 59.5 / 61.1 Continuous 65.1 / 64.9 60.0 / 60.7
  • 12. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Experiment: Self-labeling Setup: Collect additional 93,594 and 878,80 tweets from the period of 2020-set and 2021-set. Generate pseudo annotation by fine-tuned RoBERTa Large model. Result: No significant improvements. 12 Dataset Micro F1 (2021) Macro F1 (2021) 2020-set 64.8 60 2020-self-labeling 64.6 59.3 2021-self-labeling 64.2 59.3
  • 13. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Are the pseudo labels not useful at all? Setup: For incorrect prediction, retrieve the pseudo-labels for the entity within the range of 7 days, and take the most frequent prediction as the contextualized prediction. 13
  • 14. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Contextualized prediction usually matches the original one, meaning is wrong 😩 But, the second most frequent prediction is correct in average 🤔 There is a signal at least, but not easy way to utilize it. 14 Result of Contextualized Prediction
  • 15. Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados Summary ● TweetNER7, a NER dataset on Twitter with temporal-shift and 7 entity types ● Report LM fine-tuning results ○ Temporal split vs Random split ○ Continuous fine-tuning ● Self-labeling ○ Contextual prediction 15