SlideShare a Scribd company logo
Deep Neural Networks in Text Classification
using Active Learning
Mirsaeid Abolghasemi
San Jose State University
CMPE-297 Sec 49 - Advanced Deep Learning - Short story assignment
Fall 2020
1 Introduction
Three main scenarios for Active Learning:
1. Pool-based: the learner has an availability to the closed collection of unlabeled cases,
known as the pool.
2. Stream-based: the learner has the option to hold or release one case at a time.
3. Membership query synthesis: The learner makes the labeling of new artificial
cases. When the pool-based setup does not work on a single case, it is called batch-
Mode Active Learning on a batch of cases.
1 Introduction (Cont.)
Interestingly, while NNs are common, there are few researchers in the field of NLP and
fewer in the case of text classification on NN-based active learning.
The following may be the reasons for it:
1. Most DL models need a huge amount of data, which contrasts strongly with Active
Learning which expects small datasets as necessary.
2. The total Active Learning approaches focused on the generation of creating data,
which is inevitably much more complicated for text than, for instance, images, in
which data augmentation is widely used in classification tasks.
3. NNs lack uncertain information, which makes the use of a leading class of query
approaches more difficult.
2 Active Learning
There are three steps the Active Learning process which is:
● Step 1: The oracle sends a request for unlabeled instances to the active learner (query)
● Step 2: Active Learner selects and passes the unlabeled instance to the oracle(based on
the selected query strategy.)
● Step 3: The oracle labels these instances and returns back to the active learner (update).
2 Active Learning (Cont.)
● The key parts of Active Learner which are Model, Query strategy, and Stopping
criterion (optional).
● The main part for Active Learner is the query strategy which is uncertainty-based.
2.1 Query Strategies
The most common query strategies of Active Learning are classified based on the input
information of a strategy.
The input information for this study is classified into four categories:
1. Random
2. Data-Based
3. Model-Based
4. Prediction-Based
2.1 Query Strategies (Cont.)
Categorization of query strategies
2.1 Query Strategies (Cont.)
Data-based: Data-based strategies have the lowest level of knowledge, i.e. they only
operate on the raw input data and optionally the labels of the labeled pool. It is categorized
into:
1. Strategies: Strategies rely on data-uncertainty. It may use the input information
about:
a. Data distribution
b. Label distribution
c. Label correlation.
2. Representativeness: geometrically compact a collection of points, requires lesser
descriptive instances to describe the whole specifications.
2.1 Query Strategies (Cont.)
Ensembles: an ensemble is a combination of the outcome of some other strategies by a
query strategy.
1. Ensembles consist of basic query strategies
2. Ensembles may be hybrids, for instance, a combination of multiple categories of
query strategies. Also, the outcome of ensembles typically depends on the conflict
between the individual classifiers.
2.2 Neural-Network-Based Active Learning
For this part, it will be discussed that neural networks in Active Learning applications are
not more common and why. This will be focused on NLP techniques.
Two key themes can be applied to this:
1. Uncertainty estimation in NNs
2. The contrast of NNs requiring between big data and Active Learning dealing with
small data.
3 Active Learning for Text Classification
● The classical methods implement the representation of the bag-of-words (BoW).
● BoW representations are high-dimensional and sparse.
● The following new representation in word embeddings replaced BoW
representations:
○ Word2vec
○ GloVe
○ fastText
3.2 Text Classification for Active Learning
● Classic Active Learning for text classification was heavily focused on prediction-
uncertainty and ensembling.
● Popular models contained Support Vector Machines(SVMs), Naive Bayes, logistic
regression, and neural networks.
● However, Olsson has covered a large ensemble-based Active Learning for NLP in
detail
● According to recent research, no prior survey covered classical Active Learning for
text classification.
● Concerning current text classification NN-based Active Learning, the applicable
models are mainly CNN- and LSTM-based deep architectures.
3.3 Commonalities and Text classification recent work on Active
Learning:
Models in Table 1:
● Naive Bayes (NB)
● Support Vector Machine (SVM)
● k-Nearest Neighbours (kNN)
● Convolutional Neural Network (CNN)
● [Bidirectional] Long Short-Term Memory ([Bi]LSTM)
● FastText.zip (FTZ)
● Universal Language Model Fine-tuning (ULMFiT).
Query strategies in Table 1:
● Least confidence (LC)
● Closest-to-hyperplane (CTH)
● expected gradient length (EGL)
Based on the table, It is clear that a vast majority of such query
strategies belong to the prediction-uncertainty and
disagreement- based sub-classes.
The short keys of a collection
of widely-used text classification datasets.
The column "Type" shows the classification setting:
● B = binary
● MC = multi-class
● ML = multi-class multi-label
5 Outcomes of the Research and Conclusions:
Uncertainty Estimates in Neural Networks: In collaboration with NN models,
uncertainty-based strategies were successfully utilized, and the most critical aspect of
query strategies in the latest NN-based Active Learning has been discovered. Because of
inaccurate uncertainty estimates, or restricted scalability, the uncertainty in NNs is still
challenging.
Representations: The implementation of NLP text representations has progressed from
bag-of-words to text embedding. These representations bring numerous benefits, including
non-sparse vectors, disambiguation capabilities, and accuracy improvements for several
tasks.
5 Conclusions (Cont.)
Small Data DNNs: In large datasets, DL methods are typically used. Active Learning
plans to keep the data collection as small as necessary, though. Small data sets were
explained why they could challenge DNNs and also DNN- based Active Learning as a
direct result.
Learning to Learn: There are lots of query strategies, which were classified non-
exhaustively. This raises the issue of selecting the best strategy. Several variables, such as
data, model, or task, depending on the correct choice and which vary between the various
processes during the Active Learning process. This means that learning to learn (or meta-
learn) has become popular and can be used to learn the best option, or also to learn query
strategies in general.
6 Reference:
This presentation is a short story of the following paper:
● C. Schröder and A. Niekler, “A Survey of Active Learning for Text Classification
using Deep Neural Networks,” arXiv.org, August 17, 2020. [Online]. Available:
https://arxiv.org/abs/2008.07267 (Accessed: October 05, 2020).

More Related Content

What's hot

Deep learning for medical imaging
Deep learning for medical imagingDeep learning for medical imaging
Deep learning for medical imaging
geetachauhan
 
Chapter 10 Anomaly Detection
Chapter 10 Anomaly DetectionChapter 10 Anomaly Detection
Chapter 10 Anomaly Detection
Khalid Elshafie
 
Statistics - ArgMax Equation
Statistics - ArgMax EquationStatistics - ArgMax Equation
Statistics - ArgMax Equation
Andrew Ferlitsch
 
Meta-Learning Presentation
Meta-Learning PresentationMeta-Learning Presentation
Meta-Learning Presentation
AkshayaNagarajan10
 
Introduction to image processing-Class Notes
Introduction to image processing-Class NotesIntroduction to image processing-Class Notes
Introduction to image processing-Class Notes
Dr.YNM
 
Installation guide for Latex and MOODLE
Installation guide for Latex and MOODLEInstallation guide for Latex and MOODLE
Installation guide for Latex and MOODLE
abigail4894
 
Image Degradation & Resoration
Image Degradation & ResorationImage Degradation & Resoration
Image Degradation & Resoration
Sanjay Saha
 
Learning loss for active learning
Learning loss for active learningLearning loss for active learning
Learning loss for active learning
NAVER Engineering
 
Edge detection of video using matlab code
Edge detection of video using matlab codeEdge detection of video using matlab code
Edge detection of video using matlab codeBhushan Deore
 
Tuning learning rate
Tuning learning rateTuning learning rate
Tuning learning rate
Jamie (Taka) Wang
 
Morphological Image Processing
Morphological Image ProcessingMorphological Image Processing
Morphological Image Processing
kumari36
 
ML_Unit_2_Part_A
ML_Unit_2_Part_AML_Unit_2_Part_A
ML_Unit_2_Part_A
Srimatre K
 
Learning Vector Quantization LVQ
Learning Vector Quantization LVQLearning Vector Quantization LVQ
Learning Vector Quantization LVQESCOM
 
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.A.Tuğsan İşiaçık Çolak
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
Usama Fayyaz
 
Image denoising
Image denoising Image denoising
Image denoising
umarjamil10000
 
Naive Bayes Presentation
Naive Bayes PresentationNaive Bayes Presentation
Naive Bayes Presentation
Md. Enamul Haque Chowdhury
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
Databricks
 
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROUBASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
anjunarayanan
 

What's hot (20)

Deep learning for medical imaging
Deep learning for medical imagingDeep learning for medical imaging
Deep learning for medical imaging
 
Chapter 10 Anomaly Detection
Chapter 10 Anomaly DetectionChapter 10 Anomaly Detection
Chapter 10 Anomaly Detection
 
Statistics - ArgMax Equation
Statistics - ArgMax EquationStatistics - ArgMax Equation
Statistics - ArgMax Equation
 
Meta-Learning Presentation
Meta-Learning PresentationMeta-Learning Presentation
Meta-Learning Presentation
 
Introduction to image processing-Class Notes
Introduction to image processing-Class NotesIntroduction to image processing-Class Notes
Introduction to image processing-Class Notes
 
Installation guide for Latex and MOODLE
Installation guide for Latex and MOODLEInstallation guide for Latex and MOODLE
Installation guide for Latex and MOODLE
 
Image Degradation & Resoration
Image Degradation & ResorationImage Degradation & Resoration
Image Degradation & Resoration
 
Point Processing
Point ProcessingPoint Processing
Point Processing
 
Learning loss for active learning
Learning loss for active learningLearning loss for active learning
Learning loss for active learning
 
Edge detection of video using matlab code
Edge detection of video using matlab codeEdge detection of video using matlab code
Edge detection of video using matlab code
 
Tuning learning rate
Tuning learning rateTuning learning rate
Tuning learning rate
 
Morphological Image Processing
Morphological Image ProcessingMorphological Image Processing
Morphological Image Processing
 
ML_Unit_2_Part_A
ML_Unit_2_Part_AML_Unit_2_Part_A
ML_Unit_2_Part_A
 
Learning Vector Quantization LVQ
Learning Vector Quantization LVQLearning Vector Quantization LVQ
Learning Vector Quantization LVQ
 
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.
Different Tools to Detect and Monitor Oil Spills Aerial Observation Tech.
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
Image denoising
Image denoising Image denoising
Image denoising
 
Naive Bayes Presentation
Naive Bayes PresentationNaive Bayes Presentation
Naive Bayes Presentation
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
 
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROUBASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
BASICS OF DIGITAL IMAGE PROCESSING,MARIA PETROU
 

Similar to Deep Neural Networks in Text Classification using Active Learning

Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
BaoTramDuong2
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
rathnaarul
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
When deep learners change their mind learning dynamics for active learning
When deep learners change their mind  learning dynamics for active learningWhen deep learners change their mind  learning dynamics for active learning
When deep learners change their mind learning dynamics for active learning
Devansh16
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
ijsc
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
ijsc
 
Paper id 312201523
Paper id 312201523Paper id 312201523
Paper id 312201523
IJRAT
 
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
Institute of Contemporary Sciences
 
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
Daewoo Enginnering & Construction
 
NLP Techniques for Text Classification.docx
NLP Techniques for Text Classification.docxNLP Techniques for Text Classification.docx
NLP Techniques for Text Classification.docx
KevinSims18
 
METHODS FOR INCREMENTAL LEARNING: A SURVEY
METHODS FOR INCREMENTAL LEARNING: A SURVEYMETHODS FOR INCREMENTAL LEARNING: A SURVEY
METHODS FOR INCREMENTAL LEARNING: A SURVEY
IJDKP
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Adetimehin Oluwasegun Matthew
 
Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLP
Yury Kashnitsky
 
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
ijcsa
 
Semi-supervised Learning Survey - 20 years of evaluation
Semi-supervised Learning Survey - 20 years of evaluationSemi-supervised Learning Survey - 20 years of evaluation
Semi-supervised Learning Survey - 20 years of evaluation
subarna89
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
IJERA Editor
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
changedaeoh
 
Multi label text classification
Multi label text classificationMulti label text classification
Multi label text classificationraghavr186
 
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP MeetupDealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
Yves Peirsman
 
Graph Neural Prompting with Large Language Models.pptx
Graph Neural Prompting with Large Language Models.pptxGraph Neural Prompting with Large Language Models.pptx
Graph Neural Prompting with Large Language Models.pptx
ssuser2624f71
 

Similar to Deep Neural Networks in Text Classification using Active Learning (20)

Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
When deep learners change their mind learning dynamics for active learning
When deep learners change their mind  learning dynamics for active learningWhen deep learners change their mind  learning dynamics for active learning
When deep learners change their mind learning dynamics for active learning
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
 
Paper id 312201523
Paper id 312201523Paper id 312201523
Paper id 312201523
 
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
Deep Attention Model for Triage of Emergency Department Patients - Djordje Gl...
 
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
95. A State-of-the-Art Survey on Deep Learning Theory and Archtectures.pdf
 
NLP Techniques for Text Classification.docx
NLP Techniques for Text Classification.docxNLP Techniques for Text Classification.docx
NLP Techniques for Text Classification.docx
 
METHODS FOR INCREMENTAL LEARNING: A SURVEY
METHODS FOR INCREMENTAL LEARNING: A SURVEYMETHODS FOR INCREMENTAL LEARNING: A SURVEY
METHODS FOR INCREMENTAL LEARNING: A SURVEY
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLP
 
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
 
Semi-supervised Learning Survey - 20 years of evaluation
Semi-supervised Learning Survey - 20 years of evaluationSemi-supervised Learning Survey - 20 years of evaluation
Semi-supervised Learning Survey - 20 years of evaluation
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
 
Multi label text classification
Multi label text classificationMulti label text classification
Multi label text classification
 
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP MeetupDealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
Dealing with Data Scarcity in Natural Language Processing - Belgium NLP Meetup
 
Graph Neural Prompting with Large Language Models.pptx
Graph Neural Prompting with Large Language Models.pptxGraph Neural Prompting with Large Language Models.pptx
Graph Neural Prompting with Large Language Models.pptx
 

Recently uploaded

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 

Recently uploaded (20)

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 

Deep Neural Networks in Text Classification using Active Learning

  • 1. Deep Neural Networks in Text Classification using Active Learning Mirsaeid Abolghasemi San Jose State University CMPE-297 Sec 49 - Advanced Deep Learning - Short story assignment Fall 2020
  • 2. 1 Introduction Three main scenarios for Active Learning: 1. Pool-based: the learner has an availability to the closed collection of unlabeled cases, known as the pool. 2. Stream-based: the learner has the option to hold or release one case at a time. 3. Membership query synthesis: The learner makes the labeling of new artificial cases. When the pool-based setup does not work on a single case, it is called batch- Mode Active Learning on a batch of cases.
  • 3. 1 Introduction (Cont.) Interestingly, while NNs are common, there are few researchers in the field of NLP and fewer in the case of text classification on NN-based active learning. The following may be the reasons for it: 1. Most DL models need a huge amount of data, which contrasts strongly with Active Learning which expects small datasets as necessary. 2. The total Active Learning approaches focused on the generation of creating data, which is inevitably much more complicated for text than, for instance, images, in which data augmentation is widely used in classification tasks. 3. NNs lack uncertain information, which makes the use of a leading class of query approaches more difficult.
  • 4. 2 Active Learning There are three steps the Active Learning process which is: ● Step 1: The oracle sends a request for unlabeled instances to the active learner (query) ● Step 2: Active Learner selects and passes the unlabeled instance to the oracle(based on the selected query strategy.) ● Step 3: The oracle labels these instances and returns back to the active learner (update).
  • 5. 2 Active Learning (Cont.) ● The key parts of Active Learner which are Model, Query strategy, and Stopping criterion (optional). ● The main part for Active Learner is the query strategy which is uncertainty-based.
  • 6. 2.1 Query Strategies The most common query strategies of Active Learning are classified based on the input information of a strategy. The input information for this study is classified into four categories: 1. Random 2. Data-Based 3. Model-Based 4. Prediction-Based
  • 7. 2.1 Query Strategies (Cont.) Categorization of query strategies
  • 8. 2.1 Query Strategies (Cont.) Data-based: Data-based strategies have the lowest level of knowledge, i.e. they only operate on the raw input data and optionally the labels of the labeled pool. It is categorized into: 1. Strategies: Strategies rely on data-uncertainty. It may use the input information about: a. Data distribution b. Label distribution c. Label correlation. 2. Representativeness: geometrically compact a collection of points, requires lesser descriptive instances to describe the whole specifications.
  • 9. 2.1 Query Strategies (Cont.) Ensembles: an ensemble is a combination of the outcome of some other strategies by a query strategy. 1. Ensembles consist of basic query strategies 2. Ensembles may be hybrids, for instance, a combination of multiple categories of query strategies. Also, the outcome of ensembles typically depends on the conflict between the individual classifiers.
  • 10. 2.2 Neural-Network-Based Active Learning For this part, it will be discussed that neural networks in Active Learning applications are not more common and why. This will be focused on NLP techniques. Two key themes can be applied to this: 1. Uncertainty estimation in NNs 2. The contrast of NNs requiring between big data and Active Learning dealing with small data.
  • 11. 3 Active Learning for Text Classification ● The classical methods implement the representation of the bag-of-words (BoW). ● BoW representations are high-dimensional and sparse. ● The following new representation in word embeddings replaced BoW representations: ○ Word2vec ○ GloVe ○ fastText
  • 12. 3.2 Text Classification for Active Learning ● Classic Active Learning for text classification was heavily focused on prediction- uncertainty and ensembling. ● Popular models contained Support Vector Machines(SVMs), Naive Bayes, logistic regression, and neural networks. ● However, Olsson has covered a large ensemble-based Active Learning for NLP in detail ● According to recent research, no prior survey covered classical Active Learning for text classification. ● Concerning current text classification NN-based Active Learning, the applicable models are mainly CNN- and LSTM-based deep architectures.
  • 13. 3.3 Commonalities and Text classification recent work on Active Learning: Models in Table 1: ● Naive Bayes (NB) ● Support Vector Machine (SVM) ● k-Nearest Neighbours (kNN) ● Convolutional Neural Network (CNN) ● [Bidirectional] Long Short-Term Memory ([Bi]LSTM) ● FastText.zip (FTZ) ● Universal Language Model Fine-tuning (ULMFiT). Query strategies in Table 1: ● Least confidence (LC) ● Closest-to-hyperplane (CTH) ● expected gradient length (EGL) Based on the table, It is clear that a vast majority of such query strategies belong to the prediction-uncertainty and disagreement- based sub-classes.
  • 14. The short keys of a collection of widely-used text classification datasets. The column "Type" shows the classification setting: ● B = binary ● MC = multi-class ● ML = multi-class multi-label
  • 15. 5 Outcomes of the Research and Conclusions: Uncertainty Estimates in Neural Networks: In collaboration with NN models, uncertainty-based strategies were successfully utilized, and the most critical aspect of query strategies in the latest NN-based Active Learning has been discovered. Because of inaccurate uncertainty estimates, or restricted scalability, the uncertainty in NNs is still challenging. Representations: The implementation of NLP text representations has progressed from bag-of-words to text embedding. These representations bring numerous benefits, including non-sparse vectors, disambiguation capabilities, and accuracy improvements for several tasks.
  • 16. 5 Conclusions (Cont.) Small Data DNNs: In large datasets, DL methods are typically used. Active Learning plans to keep the data collection as small as necessary, though. Small data sets were explained why they could challenge DNNs and also DNN- based Active Learning as a direct result. Learning to Learn: There are lots of query strategies, which were classified non- exhaustively. This raises the issue of selecting the best strategy. Several variables, such as data, model, or task, depending on the correct choice and which vary between the various processes during the Active Learning process. This means that learning to learn (or meta- learn) has become popular and can be used to learn the best option, or also to learn query strategies in general.
  • 17. 6 Reference: This presentation is a short story of the following paper: ● C. Schröder and A. Niekler, “A Survey of Active Learning for Text Classification using Deep Neural Networks,” arXiv.org, August 17, 2020. [Online]. Available: https://arxiv.org/abs/2008.07267 (Accessed: October 05, 2020).