yourHistory - entity linking for a personalized timeline of historic events

Gaza War

Britches
World War II

Berlin Wall

Woodstock
1950

1900

1910

1970

1920

9/11

Gulf War

1930

1980

1940

1950

1990

1960

BET Hiphop Awards
2000

1970

1980

2010

1990

2000

David Graus, Maria-Hendrike Peetz,
Daan Odijk, Maarten de Rijke, Ork de Rooij

2010

Entity Linking for a personalized timeline of historic events

•

Motivation

•

Method
•
•

Part II: Generate User Proﬁle

•

Part III: Matching Events to User Proﬁle

•
•

Part I: Fetch Candidate Historic Events

Part IV: Scoring & Ranking Events
Future Work

•

[…] To design and build innovative and robust prototypes and
demos for tools that analyse and/or integrate open web data for
educational purposes.

Personalized historic timeline

Gaza War

Britches
World War II

Berlin Wall

Woodstock
1950

1900

1910

1970

1920

9/11

Gulf War

1930

1980

1940

1950

1990

1960

BET Hiphop Awards
2000

1970

1980

2010

1990

2000

2010

Part I: Candidate Historic Events

Part I: Candidate Historic Events

select
?concept

where
{

?concept
rdf:type
dbpedia-‐owl:Event

}

concept

ept
rdf:type
dbpedia-‐owl:Event

Part II: User Proﬁle

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST

Extract Information from Facebook proﬁle

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST

Access Facebook proﬁle

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST

{

"id":
"1183880085",

"likes":
{

"data":
[

{

"category":
"Musician/band",

"created_time":
"2013-‐10-‐27T11:37:51+0

"name":
"NAS",

"id":
"113591595350795"

},

{

"category":
"Company",

"created_time":
"2013-‐10-‐17T07:45:36+0

"name":
"Infinibase",

"id":
"573216229380347"

},

{

"category":
"Magazine",

"created_time":
"2013-‐10-‐04T13:55:10+0

"name":
"New
Scientist
NL",

"id":
"369158433181445"

},

Extract text
attributes

•
•
•
•
•
•

{

"id":
"1183880085",

"likes":
{

"data":
[

{

"category":
"Musician/band",

"created_time":
"2013-‐10-‐27T11:37:51+0000",

"name":
"NAS",

"id":
"113591595350795"

},

{

"category":
"Company",

"created_time":
"2013-‐10-‐17T07:45:36+0000",

"name":
"Infinibase",

"id":
"573216229380347"

},

{

"category":
"Magazine",

"created_time":
"2013-‐10-‐04T13:55:10+0000",

"name":
"New
Scientist
NL",

"id":
"369158433181445"

},

{

"category":
"Tv
show",

"created_time":
"2010-‐05-‐09T01:06:27+0000",

"name":
"The
Wire",

"id":
"5991693871"

}
]

}

•
•
•
•
•
•
•
•
•
•
•
•
•

•

Story

Omroep
Maxim

Gamer01

Breaking
Bad

AT5

Mad
Men

The
Wire

Monty
Python's

Flying
Circus

Flight
of
the

Conchords

Donnie
Darko

Flevopark
Film

Festival

Do
The
Right

Thing

A
Clockwork

Orange

Wild
Style

Princess

Mononoke

The
Fountain

Pi

Northfork

La
Haine

Zen
and
the
Art

of
Motorcycle

Maintenance

Moon
Palace

•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

Fountainhead

The
Wind-‐Up

Bird
Chronicle

Wu-‐Tang

J.Cole

NAS

Pusha
T

ASAP
Rocky

Ab-‐Soul

Chance
The

Rapper

Cannibal
Ox

Bonobo

Aesop
Rock

Boards
Of

Canada

Jurassic
5

GREMS

Quasimoto

Strange
Journey

Volume
Three

Drop
Velvet

MODESELEKTOR

IAM

Derek

The
Onion

Imgur

De
Speld

Wu-‐Tang

•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

ASAP
Rocky

Ab-‐Soul

Chance
The
Rapper

Cannibal
Ox

Bonobo

Aesop
Rock

Boards
Of
Canada

Jurassic
5

GREMS

Quasimoto

Strange
Journey
Volume
Three

Drop
Velvet

MODESELEKTOR

IAM

Derek

The
Onion

Imgur

De
Speld

Wu-‐Tang

J.Cole

I
Am
Fucking
Ambivalent
About

Science

NAS

Pusha
T

ASAP
Rocky

Chrietitie

Infinibase

Marktplaatspoxc3xabzie

Jeannette
Span
:
Spelen

Entity Linking
•

Given a Knowledge Base

•

Link mentions of entities (or concepts) to their referent entities

Entity Linking
•

From Wikipedia:
•

Extract anchor texts (words used to link to Wikipedia pages)
!
!
!
!
!
!

•

For each n-gram n ↔ Wikipedia page W estimate:
•

Probability of using n-gram n to refer to Wikipedia page W

Entity Linking Example
Link Probability
“Nas” occurs 2475x in Wikipedia

!

is anchor

1.723x

is no anchor

752x

Link Probability
“Nas” occurs 2475x in Wikipedia

!

is anchor

1723/2475

=

69,6%

is no anchor

752/2475

=

30.4%

Commonness
•

Nas is used to refer to:
•

http://en.wikipedia.org/wiki/Nas

•

http://en.wikipedia.org/wiki/Naas

•

http://en.wikipedia.org/wiki/Nås

•

http://en.wikipedia.org/wiki/Nas (Ikaria)

•

http://en.wikipedia.org/wiki/Untitled Nas album

Commonness
•

•


14x

•


4x

•


3x

•


2x

•


2x

Commonness
•

•


14/25 =

56%

•


4/25 =

1.6%

•


3/25 =

1.2%

•


2/25 =

0.8%

•


2/25 =

0.8%

{

"text":
"Nas",

"links":
[

{

"senseProbability":
0.726027397260274,

"title":
"Nas",

"url":
"http://en.wikipedia.org/wiki/Nas"

},

{

"senseProbability":
0.125,

"title":
"Naas",

"url":
"http://en.wikipedia.org/wiki/Naas"

},

{

"senseProbability":
0.1111111111111111,

"title":
"Nås",

"url":
"http://en.wikipedia.org/wiki/N%C3%A5s"

},

{

"senseProbability":
0.0006523157208088715,

"title":
"Nas
(Ikaria)",

"url":
"http://en.wikipedia.org/wiki/Nas%20%28Ikaria%29"

},

{

"senseProbability":
0.0006523157208088715,

"title":
"Untitled
Nas
album",

"url":
"http://en.wikipedia.org/wiki/Untitled%20Nas%20album"

}

}

•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

AT5

Mad
Men

The
Wire

Monty
Python's
Flying

Circus

Flight
of
the
Conchords

Donnie
Darko

Flevopark
Film
Festival

Do
The
Right
Thing

A
Clockwork
Orange

Wild
Style

Princess
Mononoke

The
Fountain

Pi

Northfork

La
Haine

Zen
and
the
Art
of

Motorcycle
Maintenance

Moon
Palace

The
Fountainhead

The
Wind-‐Up
Bird

Chronicle

Wu-‐Tang

J.Cole

Match Events to Proﬁle Entities

Map Events to Wikipedia Entities

Matching metric #1: link overlap

U.S.

Hiphop

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Global!
War

U.S.
U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

World!
War II

Global!
War

U.S.
U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

1
World!
War II

Global!
War

1

U.S.

World!
War II

U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

Link
#4

51st!
Grammy!
Awards

Global!
War

1

U.S.

World!
War II

U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

Link
#4

3
51st!
Grammy!
Awards

Matching metric #2: direct link

U.S.

Hiphop

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

51st!
Grammy!
Awards

Matching metric #3: textual similarity
NAS

51st!
Grammy!
Awards

51st!
Grammy!
Awards

World!
War II

Score: 0.74

Score: 0.35

Combine scores & rank events

"5043324":
{

"event_title":
"Iraq
War",

"related_entity_title":
"The
Wire",

"score":
1.0,

"event_date":
"2003-‐03-‐20"

},

"1376628":
{

"event_title":
"Blankets
(comics)",

"Princess
Mononoke",

"score":
0.11465851113504691,

"event_date":
"2003-‐07-‐23"

},

"15694206":
{

"event_title":
"2006
LG
Hockey
Games",

"Reimersholme",

"score":
0.3467068139664613,

"event_date":
"2006-‐04-‐29"

},

"4861876":
{

"event_title":
"2005
UEFA
Champions
League
Final",

"Istanbul",

"score":
1.0,

"event_date":
"2005-‐05-‐25"

},

"31966809":
{

"event_title":
"63rd
Primetime
Emmy
Awards",

"Mad
Men",

"score":
0.04039278737569369,

"event_date":
"2011-‐09-‐18"

},

Future Work
•

Log interactions

•

Interpret clicks as (implicit) feedback:
•

Click on Event: user is interested

•

No click on Event: user is not

•

Learn scoring & ranking functions

Thank you! Questions?
Try yourHistory:
See our poster:

http://apps.facebook.com/yourHistory 
#98

!
!
!
!
  
  
David Graus  
d.p.graus@uva.nl
@dvdgrs

yourHistory - entity linking for a personalized timeline of historic events

Recommended

Recommended

More Related Content

Similar to yourHistory - entity linking for a personalized timeline of historic events

Similar to yourHistory - entity linking for a personalized timeline of historic events (20)

More from David Graus

More from David Graus (18)

Recently uploaded

Recently uploaded (9)

yourHistory - entity linking for a personalized timeline of historic events