Bay Area NLP Reading Group - 7.12.16

•

0 likes•293 views

Slides from the July Bay Area NLP Reading Group meetup covering Boosting Named Entity Recognition with Neural Character Embeddings by dos Santos and Guimaraes

Data & Analytics

Bay Area NLP
Reading Group
July 12, 2016

Announcements
Join our Slack channel!
https://bay-area-nlp-reading.slack.com/
To join, message me (Katie Bauer) on Meetup, talk to me after the meeting or
email bay.area.nlp.reading.group@gmail.com

Want to help out?
Present a paper you love
Demo your favorite NLP tool or library
Host a future meetup
Participate!

What is NER?
Extracting proper nouns and classifying into categories
- Universally: person, location, organization
- Date/time, currencies, domain-specific
Traditional Approaches:
- gazetteers (list lookup)
- shallow parsing - ‘based in San Francisco’
Difficulties:
- Reconciling different versions of names - Noam Chomsky vs. Professor Chomsky
- Washington - person, place, collective name for US government
- May - person or month?

What are Convolutional Neural Nets?
1. Divide input into windows
2. Calculate some sort of summary
3. Feed that summary to next layer
4. Divide summary into windows
5. Summarize the summary
And so on and so forth

What does that look like for language?
Windows are word contexts
If wi
= ‘movie’,
[wi-2
, wi-1
, wi
, wi+1
, wi+2
] = [like, this, movie, very, much]
Wi
is a column vector

Model
Task: Given a sentence, score the likelihood of each named entity class word for
each word
Input:
Sentence of N words
{w1
,w2
, … , wn-1
, wn
}
Words
wn
= [wwrd
,wwch
]

Model
Scoring
Concatenate all word vectors centered around word n to get vector r
Pass r through two layers of the neural network
Check transition score Aut
to see likelihood of tags given previous tags
Store all possible tag sequences
Pick most likely sequence at end of sentence
Optimization
Sentence score is conditional probability, so minimize negative log likelihood
Backpropogated stochastic gradient descent

Corpora
Portuguese
- Word embeddings initialized with three corpora
- Trained and tested on HAREM
- HAREM 1 for training, miniHAREM for test
Spanish
- Word embeddings initialized with Spanish Wikipedia
- Trained and tested on SPA CoNLL-2002
- SPA CoNLL-2002 has predivided training, development and test sets

Experiments
Comparable Architectures:
- CharWNN - WNN
- CharNN - WNN + capitalization feature + suffix feature

Experiments
State of the Art:
- AdaBoost for Spanish - ETLCMT
for Portuguese

Experiments
Pretrained word embeddings vs. randomly initialized word embeddings

Takeaways
Different types of information are captured at word and character level
Prior knowledge (pretrained word embeddings) improves performance
With no prior knowledge, a bigger data set is better

Additional Resources
Introduction to Named Entity Recognition
https://gate.ac.uk/sale/talks/stupidpoint/diana-fb.ppt
Understanding Convolutional Neural Networks for NLP
http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/
Implementing a CNN for Text Classification in Tensorflow
http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/

Thank you!
Bay Area NLP
Reading Group
July 12, 2016

Viewers also liked

Derecho laboral.pptxandrea111colmenares

Journal of Remote Sensing & GIS vol 7 issue 3STM Journals

keynote modelsward 2017miso_uam

Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007yoga wijaya

m2fa2 ishak Frans woordjes 2 edmondvincent

m2fa2 Imed Frans woordjes 2edmondvincent

Brand guidelinesIsabelle Humm

Bethan Turner Festival of NewMR 2017Ray Poynter

Updated resumeFAREED AZIM QURESHI

18rf 31170-ts-002 (flange)Sergio Miguel Martinez

Percival Main Brochure.Ray Russell

El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...BlogsterApp Ambassador

Viewers also liked (12)

Derecho laboral.pptx

Journal of Remote Sensing & GIS vol 7 issue 3

keynote modelsward 2017

Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007

m2fa2 ishak Frans woordjes 2

m2fa2 Imed Frans woordjes 2

Brand guidelines

Bethan Turner Festival of NewMR 2017

Updated resume

18rf 31170-ts-002 (flange)

Percival Main Brochure.

El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Ukraine War presentation: KNOW THE BASICSAishani27

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

E-Commerce Order PredictionShraddha Kamble.pptx

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

04242024_CCC TUG_Joins and Relationships

PKS-TGC-1084-630 - Stage 1 Proposal.pptx

100-Concepts-of-AI by Anupama Kate .pptx

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

Unveiling Insights: The Role of a Data Analyst

Data Science Jobs and Salaries Analysis.pptx

Data Science Project: Advancements in Fetal Health Classification

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

20240419 - Measurecamp Amsterdam - SAM.pdf

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Brighton SEO | April 2024 | Data Storytelling

Ukraine War presentation: KNOW THE BASICS

Bay Area NLP Reading Group - 7.12.16

1. Bay Area NLP Reading Group July 12, 2016

2. Announcements Join our Slack channel! https://bay-area-nlp-reading.slack.com/ To join, message me (Katie Bauer) on Meetup, talk to me after the meeting or email bay.area.nlp.reading.group@gmail.com

3. Want to help out? Present a paper you love Demo your favorite NLP tool or library Host a future meetup Participate!

4. What is NER? Extracting proper nouns and classifying into categories - Universally: person, location, organization - Date/time, currencies, domain-specific Traditional Approaches: - gazetteers (list lookup) - shallow parsing - ‘based in San Francisco’ Difficulties: - Reconciling different versions of names - Noam Chomsky vs. Professor Chomsky - Washington - person, place, collective name for US government - May - person or month?

5. What are Convolutional Neural Nets? 1. Divide input into windows 2. Calculate some sort of summary 3. Feed that summary to next layer 4. Divide summary into windows 5. Summarize the summary And so on and so forth

6. What does that look like for language? Windows are word contexts If wi = ‘movie’, [wi-2 , wi-1 , wi , wi+1 , wi+2 ] = [like, this, movie, very, much] Wi is a column vector

7. Model Task: Given a sentence, score the likelihood of each named entity class word for each word Input: Sentence of N words {w1 ,w2 , … , wn-1 , wn } Words wn = [wwrd ,wwch ]

8. Model Scoring Concatenate all word vectors centered around word n to get vector r Pass r through two layers of the neural network Check transition score Aut to see likelihood of tags given previous tags Store all possible tag sequences Pick most likely sequence at end of sentence Optimization Sentence score is conditional probability, so minimize negative log likelihood Backpropogated stochastic gradient descent

9. Corpora Portuguese - Word embeddings initialized with three corpora - Trained and tested on HAREM - HAREM 1 for training, miniHAREM for test Spanish - Word embeddings initialized with Spanish Wikipedia - Trained and tested on SPA CoNLL-2002 - SPA CoNLL-2002 has predivided training, development and test sets

10. Experiments Comparable Architectures: - CharWNN - WNN - CharNN - WNN + capitalization feature + suffix feature

11. Experiments State of the Art: - AdaBoost for Spanish - ETLCMT for Portuguese

12. Experiments Portuguese by entity type

13. Experiments Pretrained word embeddings vs. randomly initialized word embeddings

14. Takeaways Different types of information are captured at word and character level Prior knowledge (pretrained word embeddings) improves performance With no prior knowledge, a bigger data set is better

15. Additional Resources Introduction to Named Entity Recognition https://gate.ac.uk/sale/talks/stupidpoint/diana-fb.ppt Understanding Convolutional Neural Networks for NLP http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/ Implementing a CNN for Text Classification in Tensorflow http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/

16. Thank you! Bay Area NLP Reading Group July 12, 2016

Bay Area NLP Reading Group - 7.12.16

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (12)

Similar to Bay Area NLP Reading Group - 7.12.16

Similar to Bay Area NLP Reading Group - 7.12.16 (20)

Recently uploaded

Recently uploaded (20)

Bay Area NLP Reading Group - 7.12.16