yourHistory - entity linking for a personalized timeline of historic events
1. Gaza War
Britches
World War II
Berlin Wall
Woodstock
1950
1900
1910
1970
1920
9/11
Gulf War
1930
1980
1940
1950
1990
1960
BET Hiphop Awards
2000
1970
1980
2010
1990
2000
David Graus, Maria-Hendrike Peetz,
Daan Odijk, Maarten de Rijke, Ork de Rooij
2010
2. Entity Linking for a personalized timeline of historic events
•
Motivation
•
Method
•
•
Part II: Generate User Profile
•
Part III: Matching Events to User Profile
•
•
Part I: Fetch Candidate Historic Events
Part IV: Scoring & Ranking Events
Future Work
3. •
[…] To design and build innovative and robust prototypes and
demos for tools that analyse and/or integrate open web data for
educational purposes.
21. Access Facebook profile
MY FACEBOOK
PROFILE
BIO
POST
POST
LIKES
POST
{
"id":
"1183880085",
"likes":
{
"data":
[
{
"category":
"Musician/band",
"created_time":
"2013-‐10-‐27T11:37:51+0
"name":
"NAS",
"id":
"113591595350795"
},
{
"category":
"Company",
"created_time":
"2013-‐10-‐17T07:45:36+0
"name":
"Infinibase",
"id":
"573216229380347"
},
{
"category":
"Magazine",
"created_time":
"2013-‐10-‐04T13:55:10+0
"name":
"New
Scientist
NL",
"id":
"369158433181445"
},
22. Extract text
attributes
•
•
•
•
•
•
{
"id":
"1183880085",
"likes":
{
"data":
[
{
"category":
"Musician/band",
"created_time":
"2013-‐10-‐27T11:37:51+0000",
"name":
"NAS",
"id":
"113591595350795"
},
{
"category":
"Company",
"created_time":
"2013-‐10-‐17T07:45:36+0000",
"name":
"Infinibase",
"id":
"573216229380347"
},
{
"category":
"Magazine",
"created_time":
"2013-‐10-‐04T13:55:10+0000",
"name":
"New
Scientist
NL",
"id":
"369158433181445"
},
{
"category":
"Tv
show",
"created_time":
"2010-‐05-‐09T01:06:27+0000",
"name":
"The
Wire",
"id":
"5991693871"
}
]
}
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Story
Omroep
Maxim
Gamer01
Breaking
Bad
AT5
Mad
Men
The
Wire
Monty
Python's
Flying
Circus
Flight
of
the
Conchords
Donnie
Darko
Flevopark
Film
Festival
Do
The
Right
Thing
A
Clockwork
Orange
Wild
Style
Princess
Mononoke
The
Fountain
Pi
Northfork
La
Haine
Zen
and
the
Art
of
Motorcycle
Maintenance
Moon
Palace
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Fountainhead
The
Wind-‐Up
Bird
Chronicle
Wu-‐Tang
J.Cole
NAS
Pusha
T
ASAP
Rocky
Ab-‐Soul
Chance
The
Rapper
Cannibal
Ox
Bonobo
Aesop
Rock
Boards
Of
Canada
Jurassic
5
GREMS
Quasimoto
Strange
Journey
Volume
Three
Drop
Velvet
MODESELEKTOR
IAM
Derek
The
Onion
Imgur
De
Speld
Wu-‐Tang
23. •
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
ASAP
Rocky
Ab-‐Soul
Chance
The
Rapper
Cannibal
Ox
Bonobo
Aesop
Rock
Boards
Of
Canada
Jurassic
5
GREMS
Quasimoto
Strange
Journey
Volume
Three
Drop
Velvet
MODESELEKTOR
IAM
Derek
The
Onion
Imgur
De
Speld
Wu-‐Tang
J.Cole
I
Am
Fucking
Ambivalent
About
Science
NAS
Pusha
T
ASAP
Rocky
Chrietitie
Infinibase
Marktplaatspoxc3xabzie
Jeannette
Span
:
Spelen
24. Entity Linking
•
Given a Knowledge Base
•
Link mentions of entities (or concepts) to their referent entities
25. Entity Linking
•
From Wikipedia:
•
Extract anchor texts (words used to link to Wikipedia pages)
!
!
!
!
!
!
•
For each n-gram n ↔ Wikipedia page W estimate:
•
Probability of using n-gram n to refer to Wikipedia page W
26. Entity Linking Example
Link Probability
“Nas” occurs 2475x in Wikipedia
!
is anchor
1.723x
is no anchor
752x
27. Entity Linking Example
Link Probability
“Nas” occurs 2475x in Wikipedia
!
is anchor
1723/2475
=
69,6%
is no anchor
752/2475
=
30.4%
28. Entity Linking Example
Commonness
•
Nas is used to refer to:
•
http://en.wikipedia.org/wiki/Nas
•
http://en.wikipedia.org/wiki/Naas
•
http://en.wikipedia.org/wiki/Nås
•
http://en.wikipedia.org/wiki/Nas (Ikaria)
•
http://en.wikipedia.org/wiki/Untitled Nas album
29. Entity Linking Example
Commonness
•
Nas is used to refer to:
•
http://en.wikipedia.org/wiki/Nas
14x
•
http://en.wikipedia.org/wiki/Naas
4x
•
http://en.wikipedia.org/wiki/Nås
3x
•
http://en.wikipedia.org/wiki/Nas (Ikaria)
2x
•
http://en.wikipedia.org/wiki/Untitled Nas album
2x
30. Entity Linking Example
Commonness
•
Nas is used to refer to:
•
http://en.wikipedia.org/wiki/Nas
14/25 =
56%
•
http://en.wikipedia.org/wiki/Naas
4/25 =
1.6%
•
http://en.wikipedia.org/wiki/Nås
3/25 =
1.2%
•
http://en.wikipedia.org/wiki/Nas (Ikaria)
2/25 =
0.8%
•
http://en.wikipedia.org/wiki/Untitled Nas album
2/25 =
0.8%
32. •
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
AT5
Mad
Men
The
Wire
Monty
Python's
Flying
Circus
Flight
of
the
Conchords
Donnie
Darko
Flevopark
Film
Festival
Do
The
Right
Thing
A
Clockwork
Orange
Wild
Style
Princess
Mononoke
The
Fountain
Pi
Northfork
La
Haine
Zen
and
the
Art
of
Motorcycle
Maintenance
Moon
Palace
The
Fountainhead
The
Wind-‐Up
Bird
Chronicle
Wu-‐Tang
J.Cole
53. Future Work
•
Log interactions
•
Interpret clicks as (implicit) feedback:
•
Click on Event: user is interested
•
No click on Event: user is not
•
Learn scoring & ranking functions
54. Thank you! Questions?
Try yourHistory:
See our poster:
http://apps.facebook.com/yourHistory
#98
!
!
!
!
David Graus
d.p.graus@uva.nl
@dvdgrs