[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction

Director of Engineering at Basis Technology
Apr. 17, 2020
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction
1 of 28

More Related Content

Similar to [Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction

Cognitive Load Theory for PPT Cognitive Load Theory for PPT
Cognitive Load Theory for PPT Swagateeka Panigrahy
The answer is bThe answer is b
The answer is bMartin Bailey
Hiring & Onboarding in Turbulent Times FTW - Interaction23.pdfHiring & Onboarding in Turbulent Times FTW - Interaction23.pdf
Hiring & Onboarding in Turbulent Times FTW - Interaction23.pdfRuss U
Owning Your CommunicationsOwning Your Communications
Owning Your Communicationsjcravens
ORA WorkshopORA Workshop
ORA WorkshopMarshall Karp
StangaOne1_A_Guide_for_EveryoneStangaOne1_A_Guide_for_Everyone
StangaOne1_A_Guide_for_EveryoneBlagovesta Hristova

More from Gil Irizarry

A Rose By Any Other Name.pdfA Rose By Any Other Name.pdf
A Rose By Any Other Name.pdfGil Irizarry
Ai for Good: Bad Guys, Messy Data, & NLPAi for Good: Bad Guys, Messy Data, & NLP
Ai for Good: Bad Guys, Messy Data, & NLPGil Irizarry
DevSecOps Orchestration of Text Analytics with ContainersDevSecOps Orchestration of Text Analytics with Containers
DevSecOps Orchestration of Text Analytics with ContainersGil Irizarry
Towards Identity Resolution: The Challenge of Name MatchingTowards Identity Resolution: The Challenge of Name Matching
Towards Identity Resolution: The Challenge of Name MatchingGil Irizarry
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...Gil Irizarry
Beginning Native Android AppsBeginning Native Android Apps
Beginning Native Android AppsGil Irizarry

Recently uploaded

Wellbeing of Wales 2023Wellbeing of Wales 2023
Wellbeing of Wales 2023Statistics for Wales @ Welsh Government
Cybersecurity Awareness Overview.pptxCybersecurity Awareness Overview.pptx
Cybersecurity Awareness Overview.pptxAfsanaMumal2
Random Forests without the Randomness Sept_2023.pptxRandom Forests without the Randomness Sept_2023.pptx
Random Forests without the Randomness Sept_2023.pptxKirkMonteverde1
Interpreting the brief B2.pptxInterpreting the brief B2.pptx
Interpreting the brief B2.pptxStephen266013
research report.pdfresearch report.pdf
research report.pdfMuthuLakshmi124949
BLOCK CHAIN TECHNOLOGY.pptxBLOCK CHAIN TECHNOLOGY.pptx
BLOCK CHAIN TECHNOLOGY.pptxPriyanka749523

[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entity extraction

Editor's Notes

  1. Rosette is a full NLP stack from language identification to morphology to entity extraction and resolution
  2. One tool will output 5 levels of sentiment and another only 3. One tool will output transitive vs. intransitive verbs and another will output only verbs. One will strip possessives (King’s Landing) and another won’t.
  3. Finding data is easier but annotating data is hard
  4. The Ukraine is now Ukraine, similarly Sudan. How do you handle the change over time?
  5. Screenshot of the TOC of our Annotation Guidelines. 42 pages. In some meetings, it’s the only doc under NDA. Header says for all. That means for all languages. We also have specific guidelines for some languages.
  6. Images from wikipedia
  7. Images from wikipedia
  8. A harmonic mean is a better balance of two values than a simple average
  9. Increasing A would lower the overall score, since both G and H would get smaller
  10. Changing the beta value allows you to tune the harmonic mean and weight either precision or recall more heavily
  11. https://link.springer.com/referenceworkentry/10.1007%2F978-0-387-39940-9_482 Precision is a single value. Average precision takes into account precision over a range of results. Mean average precision is the mean over a range of queries.
  12. http://www.davidsbatista.net/blog/2018/05/09/Named_Entity_Evaluation/ https://pdfs.semanticscholar.org/f898/e821bbf4157d857dc512a85f49610638f1aa.pdf
  13. Annotated sample of people names. Note “Cook’s” and “Powell” as references to earlier names. Note the “Emerson Collective” as an organization name is not highlighted.
  14. Precision = TP / (TP + FP), Recall = TP / (TP + FN) , F = 2*((P * R)/(P + R))
  15. AP = (sum of (True Positive / Predicted Positive)) / num of True Positive MAP = is the mean of AP over a range of different queries, for example varying the tolerances or confidences
  16. https://pdfs.semanticscholar.org/f898/e821bbf4157d857dc512a85f49610638f1aa.pdf http://www.davidsbatista.net/blog/2018/05/09/Named_Entity_Evaluation/
  17. Possible: The number of entities hand-annotated in the gold evaluation corpus, equal to (Correct + Incorrect + Partial + Missing) Actual: The number of entities tagged by the test NER system, equal to (Correct + Incorrect + Partial + Spurious) (R) Recall = (correct + (1/2 partial)) / possible (P) Precision = (correct + (1/2 partial)) / actual F =(2 * P * R) / (P + R)
  18. http://www.real-statistics.com/reliability/interrater-reliability/cohens-kappa/