Knowledge extraction and
semantic linking in the
Encyclopedia of Life
Anne Thessen
annethessen@gmail.com
Performance Metrics
• 1631 URIs assigned to 487 text objects from 21 test
species
• 83% were correct
• 20% of the text obj...
Challenges and Errors
• Many ways to say the same thing
– Uterine cannibalism = oophagy
• Negation (9%)
• Describing relat...
Finding Taxonomic Names
Challenges
Koko
Горилла
Guerilla
Eastern Lowland Gorilla
Gorilla
graueri
Gorilla berengei
Gorilla beringei
MatschieGorilla...
Challenges
Contextual data
Primate
Monkey
Eyes
Food
Panama
Aotus nancymaae
Disambiguate by
authority, species, conte
xtual...
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Upcoming SlideShare
Loading in …5
×

Knowledge Extraction and Semantic Linking in the Encyclopedia of Life

307 views
230 views

Published on

This talk describes work I have been doing with Cyndy Parr at EOL to annotate text with DBpedia URIs and to generate a species associations network. It was presented to the Boston Python User Group in April 2013.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
307
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Knowledge Extraction and Semantic Linking in the Encyclopedia of Life

  1. 1. Knowledge extraction and semantic linking in the Encyclopedia of Life Anne Thessen annethessen@gmail.com
  2. 2. Performance Metrics • 1631 URIs assigned to 487 text objects from 21 test species • 83% were correct • 20% of the text objects were not assigned a URI • 239 keys in the dictionary • Precision 0.89, Recall 1, F1 Score 0.942
  3. 3. Challenges and Errors • Many ways to say the same thing – Uterine cannibalism = oophagy • Negation (9%) • Describing related taxa (30%) • Word/phrase part (27%) • Generalities (15%) • Homonym (13%)
  4. 4. Finding Taxonomic Names
  5. 5. Challenges Koko Горилла Guerilla Eastern Lowland Gorilla Gorilla graueri Gorilla berengei Gorilla beringei MatschieGorilla beringei mikenensis King kong Gorilla gorilla Virunga Gorila Gorille Mountain gorilla 大猩猩 ゴリラ
  6. 6. Challenges Contextual data Primate Monkey Eyes Food Panama Aotus nancymaae Disambiguate by authority, species, conte xtual data Contextual data Legume Plant Flower Mirbeliea Australia Aotus mollis

×