Real-world Document Classification with Transfer Learning

•Download as PPTX, PDF•

0 likes•235 views

Abstract: Transfer learning has become the go-to approach for many natural language processing (NLP) tasks because it achieves state-of-the-art (SOTA) results with significantly less labeled data than other approaches. In this talk we will present an overview of some techniques used in modern transfer learning like 1cycle policy, discriminative learning rate, label smoothing loss function and others. We will demonstrate this approach on a multi-class document classification task using the fast.ai library. Finally, we will share some tips from implementing this approach on real-world problems. Speaker Bio: Pradeep currently works as a Data Scientist at Neudesic. He stated his career as a Research Analyst at Banner Alzheimer’s Institute working towards finding some of the earliest biomarkers of Alzheimer’s disease by applying machine learning techniques on genetic and neuro-imaging data. He holds a Master’s degree in Computational Biosciences from Arizona State University. In his spare time, he likes to climb up to the top of mountains or rappel down into the canyons.

Data & Analytics

2019 October
Desert Data Science User Group
Real-world
Document Classification
with Transfer Learning
Pradeep Thiyyagura
Data Scientist
Neudesic

"Neural networks, a beautiful biologically-inspired programming paradigm which
enables a computer to learn from observational data."
- Michael Nielsen

Goals
Provide an overview of Machine Learning
and Natural Language Processing
Get introduced to Transfer Learning and
fast.ai library
Build an algorithm to classify 20K news
articles into 20 categories
Share some tips from implementing text
classification techniques in real world

Why?
There is lot of unstructured text
data in the world
There is tremendous value to be
extracted
We can achieve SOTA results
with fewer example to train

Schedule
Theory
Machine Learning
Natural Language Processing
Transfer Learning
Practice
Fastai.text
Multi-Label Classification
Real-World Example

How do Machines Learn ?
Input Target
PredictionModel Loss Function
Update Weights

1Cycle Policy
• Neural networks can be trained an order of magnitude faster than
with standard training methods.
Leslie N. Smith 2018

Natural Language Processing
NLP is a branch of artificial
intelligence that deals with
the interaction between
computers and humans
using the natural
language.
Source

Language Model
Language
Model
Wikitext
103
Language
Model
Client ClassifierClient
Language is application independent
Language model aims to predict the next word given its previous word
Howard and Ruder 2018

Sample Task
Classify ~20 thousand documents into 20 record series

Fast.ai - Making neural nets uncool again
Python based toolkit
Built on top of Pytorch
Tools for classification, regression, time-series, collaborative filtering, data preparation, interpretation
Most modern transfer learning techniques are implemented
Easy to use
fastai

Setting up fast.ai Environment
• Install Anaconda
• https://www.anaconda.com/distribution/
• Install Git
• https://gitforwindows.org/
• Create Fastai virtual environment
• Update conda: $conda update conda
• Create new conda env: $conda create –n fastai_v1
• Activate environment: $activate fastai_v1
• Install fastai: $conda install fastai pytorch=1.0.0 -c fastai -c pytorch -c conda-forge
• Install pykernel: $conda install nb_conda_kernels
• Setup pykernel displayname: $python -m ipykernel install --user --name fastai_v1 --display-name "fastai v1"
Source

Model Interpretability
INCREASED
TRANSPARENCY
BETTER
UNDERSTANDING
DEBUGGING

Label Smoothing
Label smoothing will help
the model to
train around mislabeled
data and consequently
improve its robustness and
performance.
Training labels will be 1-β
for cat and β for not cat

Practical Tips
• If you can’t solve a problem analytically, solve it iteratively
• Use what the model already knows and correct its predictions
• If the results are bad, is it your model or are your labels bad?
• Good annotation teams are small. Collaborate with them regularly
• Data is the new software. Iterate on your code and data
• If things don’t work, don’t be afraid to scale down

Acknowledgments
Neudesic Team
• Ying Li
• Pranav Rao
• Jared Wilber
• Maxwell Curry

References and Resources
•Practical Deep Learning for Coders fastai
•But what is a Neural Network? 3blue1brown
•Visualizing a Neural Machine Translation Model Jay Alammar
•Neural Networks and Deep Learning Michael Nielsen
•Building new NLP solutions with spaCy and Prodigy Matthew Honnibal
•Software 2.0 Andrej Karpathy
•Pydata Conference Talks Pydata

What's hot

Cloudera hadoop developer trainingMagnific Trainings

Cloudera administrator trainingMagnific Trainings

Hadoop certification trainingMagnific Trainings

Cloudera hadoop developer trainingMagnific Trainings

Big data developer trainingMagnific Trainings

Hadoop training and certificationMagnific Trainings

Big data training and certificationMagnific Trainings

Cloudera hadoop developer trainingMagnific Trainings

Hadoop big data online trainingMagnific Trainings

Cloudera administrator trainingMagnific Trainings

What's hot (10)

Cloudera hadoop developer training

Cloudera administrator training

Hadoop certification training

Cloudera hadoop developer training

Big data developer training

Hadoop training and certification

Big data training and certification

Cloudera hadoop developer training

Hadoop big data online training

Cloudera administrator training

Recently uploaded (20)

Multiple time frame trading analysis -brianshannon.pdf

Real-Time AI Streaming - AI Max Princeton

Top 5 Best Data Analytics Courses In Queens

毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...

Easter Eggs From Star Wars and in cars 1 and 2

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh

Data Factory in Microsoft Fabric (MsBIP #82)

MK KOMUNIKASI DATA (TI)komdat komdat.docx

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...

Advanced Machine Learning for Business Professionals

办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一

Heart Disease Classification Report: A Data Analysis Project

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree

Identifying Appropriate Test Statistics Involving Population Mean

Generative AI for Social Good at Open Data Science East 2024

Real-world Document Classification with Transfer Learning

1. 2019 October Desert Data Science User Group Real-world Document Classification with Transfer Learning Pradeep Thiyyagura Data Scientist Neudesic

2. "Neural networks, a beautiful biologically-inspired programming paradigm which enables a computer to learn from observational data." - Michael Nielsen

3. Goals Provide an overview of Machine Learning and Natural Language Processing Get introduced to Transfer Learning and fast.ai library Build an algorithm to classify 20K news articles into 20 categories Share some tips from implementing text classification techniques in real world

4. Why? There is lot of unstructured text data in the world There is tremendous value to be extracted We can achieve SOTA results with fewer example to train

5. Schedule Theory Machine Learning Natural Language Processing Transfer Learning Practice Fastai.text Multi-Label Classification Real-World Example

7. How do Machines Learn ? Input Target PredictionModel Loss Function Update Weights

8. Learning Rate - Low Source

9. Learning Rate - Just right Source

10. Learning Rate - High Source

11. Computer Vision: ImageNet Source

12. Loss Landscape Source

13. 1Cycle Policy • Neural networks can be trained an order of magnitude faster than with standard training methods. Leslie N. Smith 2018

14. 1 Cycle Policy Learning rate: Source

15. 1 Cycle Policy Learning rate: Source

16. Transfer Learning

17. Computer Vision: ImageNet Source

18. Discriminative Learning Rate Source

19. Natural Language Processing NLP is a branch of artificial intelligence that deals with the interaction between computers and humans using the natural language. Source

20. Language Model Language Model Wikitext 103 Language Model Client ClassifierClient Language is application independent Language model aims to predict the next word given its previous word Howard and Ruder 2018

21. Sample Task Classify ~20 thousand documents into 20 record series

22. Fast.ai - Making neural nets uncool again Python based toolkit Built on top of Pytorch Tools for classification, regression, time-series, collaborative filtering, data preparation, interpretation Most modern transfer learning techniques are implemented Easy to use fastai

23. Setting up fast.ai Environment • Install Anaconda • https://www.anaconda.com/distribution/ • Install Git • https://gitforwindows.org/ • Create Fastai virtual environment • Update conda: $conda update conda • Create new conda env: $conda create –n fastai_v1 • Activate environment: $activate fastai_v1 • Install fastai: $conda install fastai pytorch=1.0.0 -c fastai -c pytorch -c conda-forge • Install pykernel: $conda install nb_conda_kernels • Setup pykernel displayname: $python -m ipykernel install --user --name fastai_v1 --display-name "fastai v1" Source

24. Demo

25. Model Interpretability INCREASED TRANSPARENCY BETTER UNDERSTANDING DEBUGGING

26. Label Smoothing Label smoothing will help the model to train around mislabeled data and consequently improve its robustness and performance. Training labels will be 1-β for cat and β for not cat

27. Practical Tips • If you can’t solve a problem analytically, solve it iteratively • Use what the model already knows and correct its predictions • If the results are bad, is it your model or are your labels bad? • Good annotation teams are small. Collaborate with them regularly • Data is the new software. Iterate on your code and data • If things don’t work, don’t be afraid to scale down

28. Acknowledgments Neudesic Team • Ying Li • Pranav Rao • Jared Wilber • Maxwell Curry

29. References and Resources •Practical Deep Learning for Coders fastai •But what is a Neural Network? 3blue1brown •Visualizing a Neural Machine Translation Model Jay Alammar •Neural Networks and Deep Learning Michael Nielsen •Building new NLP solutions with spaCy and Prodigy Matthew Honnibal •Software 2.0 Andrej Karpathy •Pydata Conference Talks Pydata

30. Thank you! Questions?

Editor's Notes

https://forums.fast.ai/t/share-your-work-here/27676/300
https://forums.fast.ai/t/share-your-work-here/27676/300
https://forums.fast.ai/t/share-your-work-here/27676/300
https://distill.pub/2019/activation-atlas/ http://image-net.org/challenges/LSVRC/2012/supervision.pdf
https://papers.nips.cc/paper/7875-visualizing-the-loss-landscape-of-neural-nets.pdf
Training resnet and inception architectures on the imagenet dataset with the standard learning rate policy (blue curve) versus a 1cycle policy that displays super-convergence. Illustrates that deep neural networks can be trained much faster (20 versus 100 epochs) than by using the standard training methods. https://arxiv.org/pdf/1803.09820.pdf
https://sgugger.github.io/the-1cycle-policy.html
https://sgugger.github.io/the-1cycle-policy.html
https://distill.pub/2019/activation-atlas/ http://image-net.org/challenges/LSVRC/2012/supervision.pdf
https://blog.floydhub.com/ten-techniques-from-fast-ai/
https://www.analyticsinhr.com/blog/natural-language-processing-revolutionize-human-resources/ The ultimate objective of NLP is to read, decipher, understand, and make sense of the human languages in a manner that is valuable.
https://medium.com/@pierre_guillou/how-to-install-fastai-v1-on-windows-10-ca1bc370dce4
https://towardsdatascience.com/label-smoothing-making-model-robust-to-incorrect-labels-2fae037ffbd0

Real-world Document Classification with Transfer Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (10)

Similar to Real-world Document Classification with Transfer Learning

Similar to Real-world Document Classification with Transfer Learning (20)

Recently uploaded

Recently uploaded (20)

Real-world Document Classification with Transfer Learning

Editor's Notes