SlideShare a Scribd company logo
1 of 14
Ben Bowles
Insight Data Science Demo
Palo Alto, CA
EntitizeThis
Named Entity Recognition
Ben Bowles
Insight Data Science Demo
Palo Alto, CA
EntitizeThis
Named Entity Recognition
Slides moving
Ben Bowles
Insight Data Science Demo
Palo Alto, CA
EntitizeThis
Named Entity Recognition
Ben Bowles
Insight Data Science Demo
Palo Alto, CA
EntitizeThis
Named Entity Recognition
Organization
Person
Location
• Develop an accurate NER
classifier, and demonstrate its
utility for search
Goal
Algorithm and Training Data
• Conditional Random Fields classifier
trained on word properties
Example Features
-uppercase?
-digit?
-title?
…
Wikipedia Training Set (143K sentences)
Elastic Search with NER
query
[named_entity]
document
[named_entity]
Webtexts +
Titles
Sentence parsing,
named entity
recognition
Classification Report for Stanford classifier
Classification Report for CRF classifier
-Performance similar
for both classifiers on
separate validation set
of 1600 sentences
entitizethis.info
github.com/bowlesbe/entitizethis
screenshot_05
screenshot_05
screenshot_05screenshot_06
NER predictions by category
-F1 scores:
0.69 vs 0.71
Improvement for
misc entities
Classifier can
easily be built
upon
ROC CURVE
HITS
FALSE ALARMS
Stanford
NER classifier
Wiki-gold
(manually
annotated
validation
set)
score f1
Stanford
NER classifier
Wiki-gold
(manually
annotated
validation
set)
score f1
My CRF
Classifier
Presentation
Presentation

More Related Content

Similar to Presentation

Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...Fwdays
 
Building Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSBuilding Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSSujit Pal
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine
Leveraging Lucene/Solr as a Knowledge Graph and Intent EngineLeveraging Lucene/Solr as a Knowledge Graph and Intent Engine
Leveraging Lucene/Solr as a Knowledge Graph and Intent EngineTrey Grainger
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemTrey Grainger
 
Learning Better Context Characterizations: An Intelligent Information Retriev...
Learning Better Context Characterizations: An Intelligent Information Retriev...Learning Better Context Characterizations: An Intelligent Information Retriev...
Learning Better Context Characterizations: An Intelligent Information Retriev...Carlos Lorenzetti
 
Distilling insights @ AppsFlyer
Distilling insights @ AppsFlyerDistilling insights @ AppsFlyer
Distilling insights @ AppsFlyerArnon Rotem-Gal-Oz
 
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DE
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DEEnterprise Search - Apache Solr in Magento Enterprise at Magento Live DE
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DETobias Zander
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Lucidworks
 
Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationRichard Littauer
 
SVC101 Building Search into Your App - AWS re: Invent 2012
SVC101 Building Search into Your App - AWS re: Invent 2012SVC101 Building Search into Your App - AWS re: Invent 2012
SVC101 Building Search into Your App - AWS re: Invent 2012Amazon Web Services
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsPaul Hofmann
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD RIILP
 
RCOMM 2011 - Sentiment Classification
RCOMM 2011 - Sentiment ClassificationRCOMM 2011 - Sentiment Classification
RCOMM 2011 - Sentiment Classificationbohanairl
 
RCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerRCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerbohanairl
 
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge BasesEvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge BasesSebastian Tramp
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataAndre Freitas
 
Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Faculties Online
 
Cassandra advanced data modeling
Cassandra advanced data modelingCassandra advanced data modeling
Cassandra advanced data modelingRomain Hardouin
 

Similar to Presentation (20)

Nikhil CV
Nikhil CVNikhil CV
Nikhil CV
 
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
 
Building Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSBuilding Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDS
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine
Leveraging Lucene/Solr as a Knowledge Graph and Intent EngineLeveraging Lucene/Solr as a Knowledge Graph and Intent Engine
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data system
 
Learning Better Context Characterizations: An Intelligent Information Retriev...
Learning Better Context Characterizations: An Intelligent Information Retriev...Learning Better Context Characterizations: An Intelligent Information Retriev...
Learning Better Context Characterizations: An Intelligent Information Retriev...
 
Distilling insights @ AppsFlyer
Distilling insights @ AppsFlyerDistilling insights @ AppsFlyer
Distilling insights @ AppsFlyer
 
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DE
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DEEnterprise Search - Apache Solr in Magento Enterprise at Magento Live DE
Enterprise Search - Apache Solr in Magento Enterprise at Magento Live DE
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
 
emnlp14v6.pptx
emnlp14v6.pptxemnlp14v6.pptx
emnlp14v6.pptx
 
Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 Presentation
 
SVC101 Building Search into Your App - AWS re: Invent 2012
SVC101 Building Search into Your App - AWS re: Invent 2012SVC101 Building Search into Your App - AWS re: Invent 2012
SVC101 Building Search into Your App - AWS re: Invent 2012
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD
 
RCOMM 2011 - Sentiment Classification
RCOMM 2011 - Sentiment ClassificationRCOMM 2011 - Sentiment Classification
RCOMM 2011 - Sentiment Classification
 
RCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerRCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMiner
 
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge BasesEvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked Data
 
Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum
 
Cassandra advanced data modeling
Cassandra advanced data modelingCassandra advanced data modeling
Cassandra advanced data modeling
 

Presentation

Editor's Notes

  1. -spin it in temrs of ad targeting -providing a platform for mobile developers to have advertisements to link directly to pertinent content within other apps -instead of just going to an app entrance, or to a webpage, they go directly to the right place in another app. -An example of this could be the exact page in living social where a relevant deal is stores. -A critical part of their business platform is the ability to serarch within other apps -interesting in improving their search algorithm by incorporating
  2. -spin it in temrs of ad targeting -providing a platform for mobile developers to have advertisements to link directly to pertinent content within other apps -instead of just going to an app entrance, or to a webpage, they go directly to the right place in another app. -An example of this could be the exact page in living social where a relevant deal is stores. -A critical part of their business platform is the ability to serarch within other apps -interesting in improving their search algorithm by incorporating
  3. -spin it in temrs of ad targeting -providing a platform for mobile developers to have advertisements to link directly to pertinent content within other apps -instead of just going to an app entrance, or to a webpage, they go directly to the right place in another app. -An example of this could be the exact page in living social where a relevant deal is stores. -A critical part of their business platform is the ability to serarch within other apps -interesting in improving their search algorithm by incorporating
  4. -spin it in temrs of ad targeting -providing a platform for mobile developers to have advertisements to link directly to pertinent content within other apps -instead of just going to an app entrance, or to a webpage, they go directly to the right place in another app. -An example of this could be the exact page in living social where a relevant deal is stores. -A critical part of their business platform is the ability to serarch within other apps -interesting in improving their search algorithm by incorporating