SlideShare a Scribd company logo
1 of 1
Download to read offline
Active Learning for Filtering
Maria-Hendrike Peetz, Damiano Spina, Maarten de Rijke, Julio Gonzalo
ISLA, Informatics Institute
University of Amsterdam
http://ilps.science.uva.nl
Lessons Learnt
UvA and UNED at RepLab 2013
• Filtering models worked well on the trial data
• Active learning with 1% improves weak classifiers
• There was not enough training data for language
dependent training.
NLP & IR Group
UNED
http://nlp.uned.es
Combine expert knowledge with computational effort
by asking the right questions.
Annotating few examples is ok.
@nokia: My phone stopped
working.
Data changes over time.
@microsoft
2012
2013
1. Feature
representation
2. Model
Training/Update
4. Candidate
Selection
5. User Feedback
Training Dataset
Model
3. Classification
Test Dataset
1. Feature
representation

new labeled
instance
Feature representation Model Training/Update
Candidate SelectionClassification
Sample tweets, annotate them, and update the model.
With a decent baseline, annotating 15 tweets per entity is enough.
1. Analysts need to keep up to
date anyway.
2. They have the ultimate
responsibility for the result.
Results
Margin/uncertainty sampling:
Selecting examples close to the
margin means sampling examples
where the classifier is less confident.
• Naive Bayes
• Tweet metadata (always)
• Entity linking of the tweets (BoE)
• Retrain NB with every new instance.
• Higher weight to newly annotated instances.
Select 1% of tweets for
annotation:
On average that are 15 tweets per
entity, in the language dependent
case: 10 English, 5 Spanish
4.2 Submitted runs
Table 5 shows the results of our o cial runs with respect to accuracy, reliability
(R), sensitivity (S), and F(R,S), the F1 Measure of Reliability and Sensitivity. We
can see that our baselines as well as the best performing system performs worse
than a simple baseline. The provided baseline selects the class of an instance
based on the class of the closest (using Jaccard similarity) instance in the training
set.
We can, however, see some interesting improvements. For a start, active learn-
ing helps. We can see that the use of 1% annotation improves the results for all
four metrics. Secondly, building a language-independent model performs better
than building two language-dependent models per entity. Finally, we can see that
the class imbalance also holds in the test set, as assigning the majority class for
strongly skewed data performs much better than using active learning alone.
Table 4. Results of the o cial runs.
run id accuracy R S F(R,S)
baseline 0.8714 0.4902 0.3200 0.3255
UvA UNED filtering 1 0.2785 0.1635 0.1258 0.0730
UvA UNED filtering 2 0.2847 0.2050 0.1441 0.0928
UvA UNED filtering 3 0.5657 0.2040 0.2369 0.1449
UvA UNED filtering 4 0.6360 0.2386 0.2782 0.1857
UvA UNED filtering 5 0.7745 0.6486 0.1833 0.1737
UvA UNED filtering 6 0.8155 0.6780 0.2187 0.2083
Table 5. Results of the o cial runs.
run id accuracy R S F(R,S)
Jaccard 0.8714 0.4902 0.3200 0.3255
BoE+lang. independent 0.2785 0.1635 0.1258 0.0730
BoE+lang. dependent 0.2847 0.2050 0.1441 0.0928
BoE+lang. independent + AL 0.5657 0.2040 0.2369 0.1449
BoE+lang. dependent + AL 0.6360 0.2386 0.2782 0.1857
BoE+lang. independent + AL + majority 0.7745 0.6486 0.1833 0.1737
BoE+lang. dependent + AL + majority 0.8155 0.6780 0.2187 0.2083
5 Conclusions
We have presented an active learning approach to company name disambigua-
tion in tweets. For this classification task, we found that active learning does
? ? ? ? ?
1 1
0 0
Task: Identify
tweets that are
relevant to a
company.
M.H.Peetz@uva.nl damiano@lsi.uned.es

More Related Content

What's hot

Supervised Machine Learning
Supervised Machine LearningSupervised Machine Learning
Supervised Machine LearningAnkit Rai
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Marina Santini
 
How to determine the sample population
How to determine the sample populationHow to determine the sample population
How to determine the sample populationschool
 
An Artificial Neural Network Model to Predict Academic Performance
An Artificial Neural Network Model to Predict Academic Performance An Artificial Neural Network Model to Predict Academic Performance
An Artificial Neural Network Model to Predict Academic Performance Abhinav Singh
 
Calmorin's formula
Calmorin's formulaCalmorin's formula
Calmorin's formulaschool
 

What's hot (10)

Supervised Machine Learning
Supervised Machine LearningSupervised Machine Learning
Supervised Machine Learning
 
Slide 1
Slide 1Slide 1
Slide 1
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
Supervised learning
  Supervised learning  Supervised learning
Supervised learning
 
How to determine the sample population
How to determine the sample populationHow to determine the sample population
How to determine the sample population
 
An Artificial Neural Network Model to Predict Academic Performance
An Artificial Neural Network Model to Predict Academic Performance An Artificial Neural Network Model to Predict Academic Performance
An Artificial Neural Network Model to Predict Academic Performance
 
Calmorin's formula
Calmorin's formulaCalmorin's formula
Calmorin's formula
 
Minimax Poster FINAL
Minimax Poster FINALMinimax Poster FINAL
Minimax Poster FINAL
 
Agile methods as problem-based learning designs: setting and assessment
Agile methods as problem-based learning designs: setting and assessmentAgile methods as problem-based learning designs: setting and assessment
Agile methods as problem-based learning designs: setting and assessment
 

Viewers also liked

Otgoo amgalan
Otgoo amgalanOtgoo amgalan
Otgoo amgalansodko27
 
Gemeenteweekend
GemeenteweekendGemeenteweekend
Gemeenteweekendfransmast
 
Atletismo12
Atletismo12Atletismo12
Atletismo12rubensg
 
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...Sistemas de recogida de información y evaluación de registros sobre Diabetes ...
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...Plan de Calidad para el SNS
 
Desportos radicais
Desportos radicaisDesportos radicais
Desportos radicaisKostinha
 
Brechas digitales e informacionales en un contexto de convergencia digital, c...
Brechas digitales e informacionales en un contexto de convergencia digital, c...Brechas digitales e informacionales en un contexto de convergencia digital, c...
Brechas digitales e informacionales en un contexto de convergencia digital, c...Plan de Calidad para el SNS
 
O que os homens querem das mulheres
O que os homens querem das mulheresO que os homens querem das mulheres
O que os homens querem das mulheresJulia Barany
 
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi treGiaSư NhaTrang
 
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...Plan de Calidad para el SNS
 
Los Portadores Del Anillo 1
Los Portadores Del Anillo 1Los Portadores Del Anillo 1
Los Portadores Del Anillo 1cyclopilas
 
Levantando dados, construindo gráficos...
Levantando dados, construindo gráficos...Levantando dados, construindo gráficos...
Levantando dados, construindo gráficos...anapaulatlb
 

Viewers also liked (20)

Otgoo amgalan
Otgoo amgalanOtgoo amgalan
Otgoo amgalan
 
Portfolio
PortfolioPortfolio
Portfolio
 
Choquejetta
ChoquejettaChoquejetta
Choquejetta
 
Maslow’s hierarchy
Maslow’s hierarchyMaslow’s hierarchy
Maslow’s hierarchy
 
Gemeenteweekend
GemeenteweekendGemeenteweekend
Gemeenteweekend
 
Atletismo12
Atletismo12Atletismo12
Atletismo12
 
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...Sistemas de recogida de información y evaluación de registros sobre Diabetes ...
Sistemas de recogida de información y evaluación de registros sobre Diabetes ...
 
Desportos radicais
Desportos radicaisDesportos radicais
Desportos radicais
 
Brechas digitales e informacionales en un contexto de convergencia digital, c...
Brechas digitales e informacionales en un contexto de convergencia digital, c...Brechas digitales e informacionales en un contexto de convergencia digital, c...
Brechas digitales e informacionales en un contexto de convergencia digital, c...
 
O que os homens querem das mulheres
O que os homens querem das mulheresO que os homens querem das mulheres
O que os homens querem das mulheres
 
Negro como el carbón
Negro como el carbónNegro como el carbón
Negro como el carbón
 
CV- Priyanka (1)
CV- Priyanka (1)CV- Priyanka (1)
CV- Priyanka (1)
 
Influencias decameron2
Influencias decameron2Influencias decameron2
Influencias decameron2
 
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre
[Giasunhatrang.edu.vn]de so 1 toan hoc tuoi tre
 
América
AméricaAmérica
América
 
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...
Experiencias de mejora de la EPOC en dos Áreas de salud de Ávila: Taller de E...
 
Los Portadores Del Anillo 1
Los Portadores Del Anillo 1Los Portadores Del Anillo 1
Los Portadores Del Anillo 1
 
Influencias decameron2
Influencias decameron2Influencias decameron2
Influencias decameron2
 
Levantando dados, construindo gráficos...
Levantando dados, construindo gráficos...Levantando dados, construindo gráficos...
Levantando dados, construindo gráficos...
 
Quebrando o Silêncio
Quebrando o SilêncioQuebrando o Silêncio
Quebrando o Silêncio
 

Similar to Towards an Active Learning System for Company Name Disambiguation in Microblog Streams

Training language models to follow instructions with human feedback (Instruct...
Training language models to follow instructions with human feedback (Instruct...Training language models to follow instructions with human feedback (Instruct...
Training language models to follow instructions with human feedback (Instruct...Rama Irsheidat
 
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...ESEM 2014
 
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...Building an Effective Skills Strategy for Spain – Consultation Workshop with ...
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...EduSkills OECD
 
Mogol johnston 2010_ncea_dif
Mogol johnston 2010_ncea_difMogol johnston 2010_ncea_dif
Mogol johnston 2010_ncea_difcv.mogol
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Marcin Junczys-Dowmunt
 
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET Journal
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsRamsha Ijaz
 
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptx
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptxQUARTER-3.-LESSON-2-in-RESEARCH-II.pptx
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptxRheaannCaparas1
 
Venkat Sastry talking on MoLE Project A Global Technology Initiative
Venkat Sastry talking on MoLE ProjectA Global Technology InitiativeVenkat Sastry talking on MoLE ProjectA Global Technology Initiative
Venkat Sastry talking on MoLE Project A Global Technology InitiativeeAssessment in Practice Symposium
 
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...janningr
 
Introduction to machine learning - Ray Poynter - NewMR webinar 2019
Introduction to machine learning  - Ray Poynter - NewMR webinar 2019Introduction to machine learning  - Ray Poynter - NewMR webinar 2019
Introduction to machine learning - Ray Poynter - NewMR webinar 2019Ray Poynter
 
Multi-layers Convolutional Neural Network for Tweet Sentiment Classification
Multi-layers Convolutional Neural Network for Tweet Sentiment ClassificationMulti-layers Convolutional Neural Network for Tweet Sentiment Classification
Multi-layers Convolutional Neural Network for Tweet Sentiment ClassificationNurfadhlina Mohd Sharef
 
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"CS, NcState
 
Authoring system of drill & practice elearning modules for hearing impaired s...
Authoring system of drill & practice elearning modules for hearing impaired s...Authoring system of drill & practice elearning modules for hearing impaired s...
Authoring system of drill & practice elearning modules for hearing impaired s...ijcsit
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...IRJET Journal
 
IRJET- Emotion Recognition using Facial Expression
IRJET-  	  Emotion Recognition using Facial ExpressionIRJET-  	  Emotion Recognition using Facial Expression
IRJET- Emotion Recognition using Facial ExpressionIRJET Journal
 
A cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievementsA cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievementsEADTU
 
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docx
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docxLab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docx
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docxcroysierkathey
 

Similar to Towards an Active Learning System for Company Name Disambiguation in Microblog Streams (20)

Training language models to follow instructions with human feedback (Instruct...
Training language models to follow instructions with human feedback (Instruct...Training language models to follow instructions with human feedback (Instruct...
Training language models to follow instructions with human feedback (Instruct...
 
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
 
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...Building an Effective Skills Strategy for Spain – Consultation Workshop with ...
Building an Effective Skills Strategy for Spain – Consultation Workshop with ...
 
Mogol johnston 2010_ncea_dif
Mogol johnston 2010_ncea_difMogol johnston 2010_ncea_dif
Mogol johnston 2010_ncea_dif
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
 
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School Exams
 
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptx
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptxQUARTER-3.-LESSON-2-in-RESEARCH-II.pptx
QUARTER-3.-LESSON-2-in-RESEARCH-II.pptx
 
Venkat Sastry talking on MoLE Project A Global Technology Initiative
Venkat Sastry talking on MoLE ProjectA Global Technology InitiativeVenkat Sastry talking on MoLE ProjectA Global Technology Initiative
Venkat Sastry talking on MoLE Project A Global Technology Initiative
 
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...
Feature Analysis for Affect Recognition Supporting Task Sequencing in Adaptiv...
 
Introduction to machine learning - Ray Poynter - NewMR webinar 2019
Introduction to machine learning  - Ray Poynter - NewMR webinar 2019Introduction to machine learning  - Ray Poynter - NewMR webinar 2019
Introduction to machine learning - Ray Poynter - NewMR webinar 2019
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Multi-layers Convolutional Neural Network for Tweet Sentiment Classification
Multi-layers Convolutional Neural Network for Tweet Sentiment ClassificationMulti-layers Convolutional Neural Network for Tweet Sentiment Classification
Multi-layers Convolutional Neural Network for Tweet Sentiment Classification
 
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"
PROMISE 2011: "Detecting Bug Duplicate Reports through Locality of Reference"
 
Authoring system of drill & practice elearning modules for hearing impaired s...
Authoring system of drill & practice elearning modules for hearing impaired s...Authoring system of drill & practice elearning modules for hearing impaired s...
Authoring system of drill & practice elearning modules for hearing impaired s...
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...
IRJET- Convolutional Neural Networks for Automatic Classification of Diabetic...
 
IRJET- Emotion Recognition using Facial Expression
IRJET-  	  Emotion Recognition using Facial ExpressionIRJET-  	  Emotion Recognition using Facial Expression
IRJET- Emotion Recognition using Facial Expression
 
A cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievementsA cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievements
 
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docx
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docxLab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docx
Lab Assignment 5Correlations and Chi-Squares in SPSS1. Tes.docx
 

More from Damiano Spina

A Formal Account of Effectiveness Evaluation and Ranking Fusion
A Formal Account of Effectiveness Evaluation and Ranking FusionA Formal Account of Effectiveness Evaluation and Ranking Fusion
A Formal Account of Effectiveness Evaluation and Ranking FusionDamiano Spina
 
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...Damiano Spina
 
Learning Similarity Functions for Topic Detection in Online Reputation Monito...
Learning Similarity Functions for Topic Detection in Online Reputation Monito...Learning Similarity Functions for Topic Detection in Online Reputation Monito...
Learning Similarity Functions for Topic Detection in Online Reputation Monito...Damiano Spina
 
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in Twitter
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in TwitterORMA: A Semi-Automatic Tool for Online Reputation Monitoring in Twitter
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in TwitterDamiano Spina
 
Online Reputation Monitoring in Twitter from an Information Access Perspective
Online Reputation Monitoring in Twitter from an Information Access PerspectiveOnline Reputation Monitoring in Twitter from an Information Access Perspective
Online Reputation Monitoring in Twitter from an Information Access PerspectiveDamiano Spina
 
UNED Online Reputation Monitoring Team at RepLab 2013
UNED Online Reputation Monitoring Team at RepLab 2013UNED Online Reputation Monitoring Team at RepLab 2013
UNED Online Reputation Monitoring Team at RepLab 2013Damiano Spina
 
Identifying Entity Aspects in Microblog Posts
Identifying Entity Aspects in Microblog PostsIdentifying Entity Aspects in Microblog Posts
Identifying Entity Aspects in Microblog PostsDamiano Spina
 
Towards Real-Time Summarization of Scheduled Events from Twitter Streams
Towards Real-Time Summarization of Scheduled Events from Twitter StreamsTowards Real-Time Summarization of Scheduled Events from Twitter Streams
Towards Real-Time Summarization of Scheduled Events from Twitter StreamsDamiano Spina
 
A Corpus for Entity Profiling in Microblog Posts
A Corpus for Entity Profiling in Microblog PostsA Corpus for Entity Profiling in Microblog Posts
A Corpus for Entity Profiling in Microblog PostsDamiano Spina
 
Filter keywords and majority class strategies for company name disambiguation...
Filter keywords and majority class strategies for company name disambiguation...Filter keywords and majority class strategies for company name disambiguation...
Filter keywords and majority class strategies for company name disambiguation...Damiano Spina
 
Evaluación de sistemas de monitorización de contenidos generados por usuarios
Evaluación de sistemas de monitorización de contenidos generados por usuariosEvaluación de sistemas de monitorización de contenidos generados por usuarios
Evaluación de sistemas de monitorización de contenidos generados por usuariosDamiano Spina
 
Caracterización de una entidad basada en opiniones: un estudio de caso
Caracterización de una entidad basada en opiniones: un estudio de casoCaracterización de una entidad basada en opiniones: un estudio de caso
Caracterización de una entidad basada en opiniones: un estudio de casoDamiano Spina
 

More from Damiano Spina (12)

A Formal Account of Effectiveness Evaluation and Ranking Fusion
A Formal Account of Effectiveness Evaluation and Ranking FusionA Formal Account of Effectiveness Evaluation and Ranking Fusion
A Formal Account of Effectiveness Evaluation and Ranking Fusion
 
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...
SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents @ ...
 
Learning Similarity Functions for Topic Detection in Online Reputation Monito...
Learning Similarity Functions for Topic Detection in Online Reputation Monito...Learning Similarity Functions for Topic Detection in Online Reputation Monito...
Learning Similarity Functions for Topic Detection in Online Reputation Monito...
 
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in Twitter
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in TwitterORMA: A Semi-Automatic Tool for Online Reputation Monitoring in Twitter
ORMA: A Semi-Automatic Tool for Online Reputation Monitoring in Twitter
 
Online Reputation Monitoring in Twitter from an Information Access Perspective
Online Reputation Monitoring in Twitter from an Information Access PerspectiveOnline Reputation Monitoring in Twitter from an Information Access Perspective
Online Reputation Monitoring in Twitter from an Information Access Perspective
 
UNED Online Reputation Monitoring Team at RepLab 2013
UNED Online Reputation Monitoring Team at RepLab 2013UNED Online Reputation Monitoring Team at RepLab 2013
UNED Online Reputation Monitoring Team at RepLab 2013
 
Identifying Entity Aspects in Microblog Posts
Identifying Entity Aspects in Microblog PostsIdentifying Entity Aspects in Microblog Posts
Identifying Entity Aspects in Microblog Posts
 
Towards Real-Time Summarization of Scheduled Events from Twitter Streams
Towards Real-Time Summarization of Scheduled Events from Twitter StreamsTowards Real-Time Summarization of Scheduled Events from Twitter Streams
Towards Real-Time Summarization of Scheduled Events from Twitter Streams
 
A Corpus for Entity Profiling in Microblog Posts
A Corpus for Entity Profiling in Microblog PostsA Corpus for Entity Profiling in Microblog Posts
A Corpus for Entity Profiling in Microblog Posts
 
Filter keywords and majority class strategies for company name disambiguation...
Filter keywords and majority class strategies for company name disambiguation...Filter keywords and majority class strategies for company name disambiguation...
Filter keywords and majority class strategies for company name disambiguation...
 
Evaluación de sistemas de monitorización de contenidos generados por usuarios
Evaluación de sistemas de monitorización de contenidos generados por usuariosEvaluación de sistemas de monitorización de contenidos generados por usuarios
Evaluación de sistemas de monitorización de contenidos generados por usuarios
 
Caracterización de una entidad basada en opiniones: un estudio de caso
Caracterización de una entidad basada en opiniones: un estudio de casoCaracterización de una entidad basada en opiniones: un estudio de caso
Caracterización de una entidad basada en opiniones: un estudio de caso
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 

Towards an Active Learning System for Company Name Disambiguation in Microblog Streams

  • 1. Active Learning for Filtering Maria-Hendrike Peetz, Damiano Spina, Maarten de Rijke, Julio Gonzalo ISLA, Informatics Institute University of Amsterdam http://ilps.science.uva.nl Lessons Learnt UvA and UNED at RepLab 2013 • Filtering models worked well on the trial data • Active learning with 1% improves weak classifiers • There was not enough training data for language dependent training. NLP & IR Group UNED http://nlp.uned.es Combine expert knowledge with computational effort by asking the right questions. Annotating few examples is ok. @nokia: My phone stopped working. Data changes over time. @microsoft 2012 2013 1. Feature representation 2. Model Training/Update 4. Candidate Selection 5. User Feedback Training Dataset Model 3. Classification Test Dataset 1. Feature representation  new labeled instance Feature representation Model Training/Update Candidate SelectionClassification Sample tweets, annotate them, and update the model. With a decent baseline, annotating 15 tweets per entity is enough. 1. Analysts need to keep up to date anyway. 2. They have the ultimate responsibility for the result. Results Margin/uncertainty sampling: Selecting examples close to the margin means sampling examples where the classifier is less confident. • Naive Bayes • Tweet metadata (always) • Entity linking of the tweets (BoE) • Retrain NB with every new instance. • Higher weight to newly annotated instances. Select 1% of tweets for annotation: On average that are 15 tweets per entity, in the language dependent case: 10 English, 5 Spanish 4.2 Submitted runs Table 5 shows the results of our o cial runs with respect to accuracy, reliability (R), sensitivity (S), and F(R,S), the F1 Measure of Reliability and Sensitivity. We can see that our baselines as well as the best performing system performs worse than a simple baseline. The provided baseline selects the class of an instance based on the class of the closest (using Jaccard similarity) instance in the training set. We can, however, see some interesting improvements. For a start, active learn- ing helps. We can see that the use of 1% annotation improves the results for all four metrics. Secondly, building a language-independent model performs better than building two language-dependent models per entity. Finally, we can see that the class imbalance also holds in the test set, as assigning the majority class for strongly skewed data performs much better than using active learning alone. Table 4. Results of the o cial runs. run id accuracy R S F(R,S) baseline 0.8714 0.4902 0.3200 0.3255 UvA UNED filtering 1 0.2785 0.1635 0.1258 0.0730 UvA UNED filtering 2 0.2847 0.2050 0.1441 0.0928 UvA UNED filtering 3 0.5657 0.2040 0.2369 0.1449 UvA UNED filtering 4 0.6360 0.2386 0.2782 0.1857 UvA UNED filtering 5 0.7745 0.6486 0.1833 0.1737 UvA UNED filtering 6 0.8155 0.6780 0.2187 0.2083 Table 5. Results of the o cial runs. run id accuracy R S F(R,S) Jaccard 0.8714 0.4902 0.3200 0.3255 BoE+lang. independent 0.2785 0.1635 0.1258 0.0730 BoE+lang. dependent 0.2847 0.2050 0.1441 0.0928 BoE+lang. independent + AL 0.5657 0.2040 0.2369 0.1449 BoE+lang. dependent + AL 0.6360 0.2386 0.2782 0.1857 BoE+lang. independent + AL + majority 0.7745 0.6486 0.1833 0.1737 BoE+lang. dependent + AL + majority 0.8155 0.6780 0.2187 0.2083 5 Conclusions We have presented an active learning approach to company name disambigua- tion in tweets. For this classification task, we found that active learning does ? ? ? ? ? 1 1 0 0 Task: Identify tweets that are relevant to a company. M.H.Peetz@uva.nl damiano@lsi.uned.es