SlideShare a Scribd company logo
1 of 16
Download to read offline
Bay Area NLP
Reading Group
July 12, 2016
Announcements
Join our Slack channel!
https://bay-area-nlp-reading.slack.com/
To join, message me (Katie Bauer) on Meetup, talk to me after the meeting or
email bay.area.nlp.reading.group@gmail.com
Want to help out?
Present a paper you love
Demo your favorite NLP tool or library
Host a future meetup
Participate!
What is NER?
Extracting proper nouns and classifying into categories
- Universally: person, location, organization
- Date/time, currencies, domain-specific
Traditional Approaches:
- gazetteers (list lookup)
- shallow parsing - ‘based in San Francisco’
Difficulties:
- Reconciling different versions of names - Noam Chomsky vs. Professor Chomsky
- Washington - person, place, collective name for US government
- May - person or month?
What are Convolutional Neural Nets?
1. Divide input into windows
2. Calculate some sort of summary
3. Feed that summary to next layer
4. Divide summary into windows
5. Summarize the summary
And so on and so forth
What does that look like for language?
Windows are word contexts
If wi
= ‘movie’,
[wi-2
, wi-1
, wi
, wi+1
, wi+2
] = [like, this, movie, very, much]
Wi
is a column vector
Model
Task: Given a sentence, score the likelihood of each named entity class word for
each word
Input:
Sentence of N words
{w1
,w2
, … , wn-1
, wn
}
Words
wn
= [wwrd
,wwch
]
Model
Scoring
Concatenate all word vectors centered around word n to get vector r
Pass r through two layers of the neural network
Check transition score Aut
to see likelihood of tags given previous tags
Store all possible tag sequences
Pick most likely sequence at end of sentence
Optimization
Sentence score is conditional probability, so minimize negative log likelihood
Backpropogated stochastic gradient descent
Corpora
Portuguese
- Word embeddings initialized with three corpora
- Trained and tested on HAREM
- HAREM 1 for training, miniHAREM for test
Spanish
- Word embeddings initialized with Spanish Wikipedia
- Trained and tested on SPA CoNLL-2002
- SPA CoNLL-2002 has predivided training, development and test sets
Experiments
Comparable Architectures:
- CharWNN - WNN
- CharNN - WNN + capitalization feature + suffix feature
Experiments
State of the Art:
- AdaBoost for Spanish - ETLCMT
for Portuguese
Experiments
Portuguese by entity type
Experiments
Pretrained word embeddings vs. randomly initialized word embeddings
Takeaways
Different types of information are captured at word and character level
Prior knowledge (pretrained word embeddings) improves performance
With no prior knowledge, a bigger data set is better
Additional Resources
Introduction to Named Entity Recognition
https://gate.ac.uk/sale/talks/stupidpoint/diana-fb.ppt
Understanding Convolutional Neural Networks for NLP
http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/
Implementing a CNN for Text Classification in Tensorflow
http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/
Thank you!
Bay Area NLP
Reading Group
July 12, 2016

More Related Content

Viewers also liked

Journal of Remote Sensing & GIS vol 7 issue 3
Journal of Remote Sensing & GIS vol 7 issue 3Journal of Remote Sensing & GIS vol 7 issue 3
Journal of Remote Sensing & GIS vol 7 issue 3STM Journals
 
keynote modelsward 2017
keynote modelsward 2017keynote modelsward 2017
keynote modelsward 2017miso_uam
 
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007yoga wijaya
 
m2fa2 ishak Frans woordjes 2
m2fa2 ishak Frans woordjes 2 m2fa2 ishak Frans woordjes 2
m2fa2 ishak Frans woordjes 2 edmondvincent
 
m2fa2 Imed Frans woordjes 2
m2fa2 Imed Frans woordjes 2m2fa2 Imed Frans woordjes 2
m2fa2 Imed Frans woordjes 2edmondvincent
 
Bethan Turner Festival of NewMR 2017
Bethan Turner Festival of NewMR 2017Bethan Turner Festival of NewMR 2017
Bethan Turner Festival of NewMR 2017Ray Poynter
 
Percival Main Brochure.
Percival Main Brochure.Percival Main Brochure.
Percival Main Brochure.Ray Russell
 
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...BlogsterApp Ambassador
 

Viewers also liked (12)

Derecho laboral.pptx
Derecho laboral.pptxDerecho laboral.pptx
Derecho laboral.pptx
 
Journal of Remote Sensing & GIS vol 7 issue 3
Journal of Remote Sensing & GIS vol 7 issue 3Journal of Remote Sensing & GIS vol 7 issue 3
Journal of Remote Sensing & GIS vol 7 issue 3
 
keynote modelsward 2017
keynote modelsward 2017keynote modelsward 2017
keynote modelsward 2017
 
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007
Perbedaan Pada Page Layout Ms.Excel 2003 dan Ms.Excel 2007
 
m2fa2 ishak Frans woordjes 2
m2fa2 ishak Frans woordjes 2 m2fa2 ishak Frans woordjes 2
m2fa2 ishak Frans woordjes 2
 
m2fa2 Imed Frans woordjes 2
m2fa2 Imed Frans woordjes 2m2fa2 Imed Frans woordjes 2
m2fa2 Imed Frans woordjes 2
 
Brand guidelines
Brand guidelinesBrand guidelines
Brand guidelines
 
Bethan Turner Festival of NewMR 2017
Bethan Turner Festival of NewMR 2017Bethan Turner Festival of NewMR 2017
Bethan Turner Festival of NewMR 2017
 
Updated resume
Updated resumeUpdated resume
Updated resume
 
18rf 31170-ts-002 (flange)
18rf 31170-ts-002 (flange)18rf 31170-ts-002 (flange)
18rf 31170-ts-002 (flange)
 
Percival Main Brochure.
Percival Main Brochure.Percival Main Brochure.
Percival Main Brochure.
 
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...
El "Cartel" de Periscope. Como distribuir tu contenido por el mundo. Por Borj...
 

Similar to Bay Area NLP Reading Group - 7.12.16

Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Rajnish Raj
 
Webinar: OpenNLP and Solr for Superior Relevance
Webinar: OpenNLP and Solr for Superior RelevanceWebinar: OpenNLP and Solr for Superior Relevance
Webinar: OpenNLP and Solr for Superior RelevanceLucidworks
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Content Processing Architecture and Applications - Introduction to Text Mining
Content Processing Architecture and Applications - Introduction to Text MiningContent Processing Architecture and Applications - Introduction to Text Mining
Content Processing Architecture and Applications - Introduction to Text MiningFindwise
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlpLaraOlmosCamarena
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
A Corpus-based Approach to Tracking L2 Development
A Corpus-based Approach to Tracking L2 DevelopmentA Corpus-based Approach to Tracking L2 Development
A Corpus-based Approach to Tracking L2 DevelopmentCALPER
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataAndre Freitas
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyanrudolf eremyan
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflowseungwoo kim
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxAlyaaMachi
 
NLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inNLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inKumari Naveen
 
lect36-tasks.ppt
lect36-tasks.pptlect36-tasks.ppt
lect36-tasks.pptHaHa501620
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingParrotAI
 
Using construction grammar in conversational systems
Using construction grammar in conversational systemsUsing construction grammar in conversational systems
Using construction grammar in conversational systemsCJ Jenkins
 
Portuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and HowPortuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and HowValeria de Paiva
 
Using OpenNLP with Solr to improve search relevance and to extract named enti...
Using OpenNLP with Solr to improve search relevance and to extract named enti...Using OpenNLP with Solr to improve search relevance and to extract named enti...
Using OpenNLP with Solr to improve search relevance and to extract named enti...Steve Rowe
 

Similar to Bay Area NLP Reading Group - 7.12.16 (20)

Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
Webinar: OpenNLP and Solr for Superior Relevance
Webinar: OpenNLP and Solr for Superior RelevanceWebinar: OpenNLP and Solr for Superior Relevance
Webinar: OpenNLP and Solr for Superior Relevance
 
NLP todo
NLP todoNLP todo
NLP todo
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
NLP
NLPNLP
NLP
 
NLP
NLPNLP
NLP
 
Content Processing Architecture and Applications - Introduction to Text Mining
Content Processing Architecture and Applications - Introduction to Text MiningContent Processing Architecture and Applications - Introduction to Text Mining
Content Processing Architecture and Applications - Introduction to Text Mining
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlp
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
A Corpus-based Approach to Tracking L2 Development
A Corpus-based Approach to Tracking L2 DevelopmentA Corpus-based Approach to Tracking L2 Development
A Corpus-based Approach to Tracking L2 Development
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked Data
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptx
 
NLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inNLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful in
 
lect36-tasks.ppt
lect36-tasks.pptlect36-tasks.ppt
lect36-tasks.ppt
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
Using construction grammar in conversational systems
Using construction grammar in conversational systemsUsing construction grammar in conversational systems
Using construction grammar in conversational systems
 
Portuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and HowPortuguese Linguistic Tools: What, Why and How
Portuguese Linguistic Tools: What, Why and How
 
Using OpenNLP with Solr to improve search relevance and to extract named enti...
Using OpenNLP with Solr to improve search relevance and to extract named enti...Using OpenNLP with Solr to improve search relevance and to extract named enti...
Using OpenNLP with Solr to improve search relevance and to extract named enti...
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 

Bay Area NLP Reading Group - 7.12.16

  • 1. Bay Area NLP Reading Group July 12, 2016
  • 2. Announcements Join our Slack channel! https://bay-area-nlp-reading.slack.com/ To join, message me (Katie Bauer) on Meetup, talk to me after the meeting or email bay.area.nlp.reading.group@gmail.com
  • 3. Want to help out? Present a paper you love Demo your favorite NLP tool or library Host a future meetup Participate!
  • 4. What is NER? Extracting proper nouns and classifying into categories - Universally: person, location, organization - Date/time, currencies, domain-specific Traditional Approaches: - gazetteers (list lookup) - shallow parsing - ‘based in San Francisco’ Difficulties: - Reconciling different versions of names - Noam Chomsky vs. Professor Chomsky - Washington - person, place, collective name for US government - May - person or month?
  • 5. What are Convolutional Neural Nets? 1. Divide input into windows 2. Calculate some sort of summary 3. Feed that summary to next layer 4. Divide summary into windows 5. Summarize the summary And so on and so forth
  • 6. What does that look like for language? Windows are word contexts If wi = ‘movie’, [wi-2 , wi-1 , wi , wi+1 , wi+2 ] = [like, this, movie, very, much] Wi is a column vector
  • 7. Model Task: Given a sentence, score the likelihood of each named entity class word for each word Input: Sentence of N words {w1 ,w2 , … , wn-1 , wn } Words wn = [wwrd ,wwch ]
  • 8. Model Scoring Concatenate all word vectors centered around word n to get vector r Pass r through two layers of the neural network Check transition score Aut to see likelihood of tags given previous tags Store all possible tag sequences Pick most likely sequence at end of sentence Optimization Sentence score is conditional probability, so minimize negative log likelihood Backpropogated stochastic gradient descent
  • 9. Corpora Portuguese - Word embeddings initialized with three corpora - Trained and tested on HAREM - HAREM 1 for training, miniHAREM for test Spanish - Word embeddings initialized with Spanish Wikipedia - Trained and tested on SPA CoNLL-2002 - SPA CoNLL-2002 has predivided training, development and test sets
  • 10. Experiments Comparable Architectures: - CharWNN - WNN - CharNN - WNN + capitalization feature + suffix feature
  • 11. Experiments State of the Art: - AdaBoost for Spanish - ETLCMT for Portuguese
  • 13. Experiments Pretrained word embeddings vs. randomly initialized word embeddings
  • 14. Takeaways Different types of information are captured at word and character level Prior knowledge (pretrained word embeddings) improves performance With no prior knowledge, a bigger data set is better
  • 15. Additional Resources Introduction to Named Entity Recognition https://gate.ac.uk/sale/talks/stupidpoint/diana-fb.ppt Understanding Convolutional Neural Networks for NLP http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/ Implementing a CNN for Text Classification in Tensorflow http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/
  • 16. Thank you! Bay Area NLP Reading Group July 12, 2016