SlideShare a Scribd company logo
CORE: Context-Aware Open Relation Extraction
with Factorization Machines
Fabio Petroni
Luciano Del Corro Rainer Gemulla
Open relation extraction
I Open relation extraction is the task of extracting new facts for
a potentially unbounded set of relations from various sources
natural
language
text
knowledge
bases
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 2 of 21
Input data: facts from natural language text
Enrico Fermi was a
professor in theoretical
physics at Sapienza
University of Rome.
"professor at"(Fermi,Sapienza)
tuple
sub objrel
surface fact
open information
extractor
extract all
facts in textsurface relation
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 3 of 21
Input data: facts from knowledge bases
natural
language
text
Fermi
Sapienza
employee(Fermi,Sapienza)
KB fact
employee
KB relation
"professor at"(Fermi,Sapienza)
surface fact
entity link
e.g., string match
heuristic
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 4 of 21
Relation extraction techniques taxonomy
distant
supervision
relation
extraction
tensor
completion
matrix
completion
RESCAL PITF NFE CORE
relation
clustering
set of predefined
relations
"black and white"
approach
limited scalability with the number
of relations; large prediction space
latent factors
models
openclose
in-KB in-KB
out of-KB
restricted prediction space
(Nickel et al., 2011) (Drumond et al., 2012) (Petroni et al., 2015)(Riedel et al., 2013)
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 5 of 21
Matrix completion for open relation extraction
(Caesar,Rome)
(Fermi,Rome)
(Fermi,Sapienza)
(de Blasio,NY)
1
1
1
1
1
KB relationsurface relation
employeeborn in professor at mayor of
tuples x relations
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 6 of 21
Matrix completion for open relation extraction
(Caesar,Rome)
(Fermi,Rome)
(Fermi,Sapienza)
(de Blasio,NY)
1
1
1
1
1
KB relationsurface relation
employeeborn in professor at mayor of
tuples x relations
?
? ??
? ??
?
? ? ?
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 6 of 21
Matrix factorization
I learn latent semantic representations of tuples and relations
tuple latent
factor vector
relation latent
factor vector
dot product
I leverage latent representations to predict new facts
professor at
related with science0.8
(Fermi,Sapienza)
0.9
-0.5 -0.3 related with sport
I in real applications latent factors are uninterpretable
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 7 of 21
Matrix factorization
CORE integrates contextual
information into such
models to improve
prediction performance
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 7 of 21
Contextual information
Tom Peloso joined
Modest Mouse to record
their fifth studio album.
person
organization
surface relation
"join"(Peloso,Modest Mouse)
unspecific relation
entity types
article topic
words record album
Contextual information
named entity
recognizer
label entity
with coarse-
grained type
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 8 of 21
Contextual information
Tom Peloso joined
Modest Mouse to record
their fifth studio album.
person
organization
surface relation
"join"(Peloso,Modest Mouse)
unspecific relation
entity types
article topic
words record album
Contextual information
How to incorporate contextual
information within the model?
How to incorporate contextual
information within the model?
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 8 of 21
CORE - latent representations of variables
I associates latent representations fv with each variable v 2 V
Peloso
(Peloso,Modest Mouse)
Modest Mouse
join
person
organization
record
album
tuple
relation
entities
context
latent
factor
vectors
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 9 of 21
CORE - modeling facts
1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1
0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0
1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4
0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0
“born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person,
organization
person,
location
physics history
relations tuples entities tuple types tuple topics
x1
x2
x3
x4
Surface KB Context
…
“professor at”(x,y)
I models the input data in terms of a matrix in which each row
corresponds to a fact x and each column to a variable v
I groups columns according to the type of the variables
I in each row the values of each column group sum up to unity
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 10 of 21
CORE - modeling context
1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1
0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0
1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4
0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0
“born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person,
organization
person,
location
physics history
relations tuples entities tuple types tuple topics
x1
x2
x3
x4
Surface KB Context
…
“professor at”(x,y)
I aggregates and normalizes contextual information by tuple
B a fact can be observed multiple times with di↵erent context
B there is no context for new facts (never observed in input)
I this approach allows us to provide comprehensive contextual
information for both observed and unobserved facts
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 11 of 21
CORE - factorization model
1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1
0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0
1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4
0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0
“born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person,
organization
person,
location
physics history
relations tuples entities tuple types tuple topics
x1
x2
x3
x4
Surface KB Context
…
“professor at”(x,y)
I uses factorization machines as underlying framework
I associates a score s(x) with each fact x
s(x) =
X
v12V
X
v22V {v1}
xv1 xv2 f T
v1
fv2
I weighted pairwise interactions of latent factor vectors
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 12 of 21
CORE - prediction
produce a ranked list of tuples for each relation
goal
I rank reflects the likelihood that the corresponding fact is true
I to generate this ranked list:
B fix a relation r
B retrieve all tuples t, s.t. the fact r(t) is not observed
B add tuple context
B rank unobserved facts by their scores
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 13 of 21
CORE - parameter estimation
I parameters: ⇥ = { bv , fv | v 2 V }
I all our observations are positive, no negative training data
I Bayesian personalized ranking, open-world assumption
professor at
location
tuple entities tuple contextfact
(Fermi,Sapienza)
Fermi
Sapienza organization
physics
person
x
(Caesar,Rome)
professor at Caesar
locationRome
history
tuple entities tuple contextfact
person
x-sampled negative evidenceobserved fact
I pairwise approach, x is more likely to be true than x-
maximize
X
x
f (s(x) s(x-))
I stochastic gradient ascent
⇥ ⇥ + ⌘r⇥ ( )
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 14 of 21
Experiments - dataset
440k facts
extracted from
corpus
15k facts
from
entity mentions
linked using
string matching
I Contextual information
article metadata bag-of-word
sentences where
the fact has been
extracted
entity type
person
organization
location
miscellaneous
news desk (e.g., foreign desk)
descriptors (e.g., finances)
online section (e.g., sports)
section (e.g., a, d)
publication year m t w
I letters to indicate contextual information considered
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 15 of 21
Experiments - methodology
I we consider (to keep experiments feasible):
10k tuples 10 surface relations19 Freebase relations
I for each relation and method:
B we rank the tuples subsample
B we consider the top-100 predictions and label them manually
I evaluation metrics:
number of true facts MAP (quality of the ranking)
I methods:
B PITF, tensor factorization method
B NFE, matrix completion method (context-agnostic)
B CORE, uses relations, tuples and entities as variables
B CORE+m, +t, +w, +mt, +mtw
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 16 of 21
Results - Freebase relations
Relation # PITF NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw
person/company 208 70 (0.47) 92 (0.81) 91 (0.83) 90 (0.84) 91 (0.87) 92 (0.87) 95 (0.93) 96 (0.94)
person/place of birth 117 1 (0.0) 92 (0.9) 90 (0.88) 92 (0.9) 92 (0.9) 89 (0.87) 93 (0.9) 92 (0.9)
location/containedby 102 7 (0.0) 63 (0.47) 62 (0.47) 63 (0.46) 61 (0.47) 61 (0.44) 62 (0.49) 68 (0.55)
parent/child 88 9 (0.01) 64 (0.6) 64 (0.56) 64 (0.59) 64 (0.62) 64 (0.57) 67 (0.67) 68 (0.63)
person/place of death 71 1 (0.0) 67 (0.93) 67 (0.92) 69 (0.94) 67 (0.93) 67 (0.92) 69 (0.94) 67 (0.92)
person/parents 67 20 (0.1) 51 (0.64) 52 (0.62) 51 (0.61) 49 (0.64) 47 (0.6) 53 (0.67) 53 (0.65)
author/works written 65 24 (0.08) 45 (0.59) 49 (0.62) 51 (0.69) 50 (0.68) 50 (0.68) 51 (0.7) 52 (0.67)
person/nationality 61 21 (0.08) 25 (0.19) 27 (0.17) 28 (0.2) 26 (0.2) 29 (0.19) 27 (0.18) 27 (0.21)
neighbor./neighborhood of 39 3 (0.0) 24 (0.44) 23 (0.45) 26 (0.5) 27 (0.47) 27 (0.49) 30 (0.51) 30 (0.52)
...
Average MAP100
# 0.09 0.46 0.47 0.49 0.47 0.49 0.49 0.51
Weighted Average MAP100
# 0.14 0.64 0.64 0.66 0.67 0.66 0.70 0.70
0.5 0.55 0.6 0.65 0.7
NFE
CORE
CORE+m
CORE+t
CORE+w
CORE+mt
CORE+mtw
Weighted Average MAP
0.64
0.64
0.66
0.67
0.66
0.70
0.70
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 17 of 21
Results - surface relations
Relation # PITF NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw
head 162 34 (0.18) 80 (0.66) 83 (0.66) 82 (0.63) 76 (0.57) 77 (0.57) 83 (0.69) 88 (0.73)
scientist 144 44 (0.17) 76 (0.6) 74 (0.55) 73 (0.56) 74 (0.6) 73 (0.59) 78 (0.66) 78 (0.69)
base 133 10 (0.01) 85 (0.71) 86 (0.71) 86 (0.78) 88 (0.79) 85 (0.75) 83 (0.76) 89 (0.8)
visit 118 4 (0.0) 73 (0.6) 75 (0.61) 76 (0.64) 80 (0.68) 74 (0.64) 75 (0.66) 82 (0.74)
attend 92 11 (0.02) 65 (0.58) 64 (0.59) 65 (0.63) 62 (0.6) 66 (0.63) 62 (0.58) 69 (0.64)
adviser 56 2 (0.0) 42 (0.56) 47 (0.58) 44 (0.58) 43 (0.59) 45 (0.63) 43 (0.53) 44 (0.63)
criticize 40 5 (0.0) 31 (0.66) 33 (0.62) 33 (0.7) 33 (0.67) 33 (0.61) 35 (0.69) 37 (0.69)
support 33 3 (0.0) 19 (0.27) 22 (0.28) 18 (0.21) 19 (0.28) 22 (0.27) 23 (0.27) 21 (0.27)
praise 5 0 (0.0) 2 (0.0) 2 (0.01) 4 (0.03) 3 (0.01) 3 (0.02) 5 (0.03) 2 (0.01)
vote 3 2 (0.01) 3 (0.63) 3 (0.63) 3 (0.32) 3 (0.49) 3 (0.51) 3 (0.59) 3 (0.64)
Average MAP100
# 0.04 0.53 0.53 0.51 0.53 0.53 0.55 0.59
Weighted Average MAP100
# 0.08 0.62 0.61 0.63 0.63 0.61 0.65 0.70
0.5 0.6 0.7
NFE
CORE
CORE+m
CORE+t
CORE+w
CORE+mt
CORE+mtw
Weighted Average MAP
0.62
0.61
0.63
0.63
0.61
0.65
0.70
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 18 of 21
Anecdotal results
author(x,y)
ranked list of tuples
1 (Winston Groom, Forrest Gump)
2 (D. M. Thomas, White Hotel)
3 (Roger Rosenblatt, Life Itself)
4 (Edmund White, Skinned Alive)
5 (Peter Manso, Brando: The Biography)
similar relations
0.98 “reviews x by y”(x,y)
0.97 “book by”(x,y)
0.95 “author of”(x,y)
0.95 ” ‘s novel”(x,y)
0.95 “ ‘s book”(x,y)
similar relations
0.87 “scientist”(x,y)
0.84 “scientist with”(x,y)
0.80 “professor at”(x,y)
0.79 “scientist for”(x,y)
0.78 “neuroscientist at”(x,y)
ranked list of tuples
1 (Riordan Roett, Johns Hopkins University)
2 (Dr. R. M. Roberts, University of Missouri)
3 (Linda Mayes, Yale University)
4 (Daniel T. Jones, Cardiff Business School)
5 (Russell Ross, University of Iowa)
“scientist at”(x,y)
I semantic similarity of relations is one aspect of our model
I similar relations treated di↵erently in di↵erent contexts
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 19 of 21
Conclusion
I CORE, a matrix factorization model for open relation
extraction that incorporates contextual information
I based on factorization machines and open-world assumption
I extensible model, additional contextual information can be
integrated when available
I experimental study suggests that exploiting context can
significantly improve prediction performance
I Source code, datasets, and supporting material are available
at https://github.com/fabiopetroni/CORE
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 20 of 21
Thank you!
Questions?
Fabio Petroni
Sapienza University of Rome, Italy
Current position:
PhD Student in Engineering in Computer Science
Research Interests:
data mining, machine learning, big data
petroni@dis.uniroma1.it
EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 21 of 21

More Related Content

Similar to CORE: Context-Aware Open Relation Extraction with Factorization Machines

3_MLE_printable.pdf
3_MLE_printable.pdf3_MLE_printable.pdf
3_MLE_printable.pdf
Elio Laureano
 
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
Istituto nazionale di statistica
 
Recommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenuRecommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenu
Patrice Bellot - Aix-Marseille Université / CNRS (LIS, INS2I)
 
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML
 
slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16
Christian Robert
 
Class9_PCA_final.ppt
Class9_PCA_final.pptClass9_PCA_final.ppt
Class9_PCA_final.ppt
MaTruongThanh002937
 
Nimrita koul Machine Learning
Nimrita koul  Machine LearningNimrita koul  Machine Learning
Nimrita koul Machine Learning
Nimrita Koul
 
Lecture20 xing
Lecture20 xingLecture20 xing
Lecture20 xing
Tianlu Wang
 
Building WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spacesBuilding WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spaces
Pierpaolo Basile
 
Presentation of volesti in eRum 2020
Presentation of volesti in eRum 2020 Presentation of volesti in eRum 2020
Presentation of volesti in eRum 2020
Apostolos Chalkis
 
Evoknow17 Large Scale Problems in Practice
Evoknow17 Large Scale Problems in PracticeEvoknow17 Large Scale Problems in Practice
Evoknow17 Large Scale Problems in Practice
Fabio Caraffini
 
Link Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: AccuracyLink Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: Accuracy
Holistic Benchmarking of Big Linked Data
 
Csit77406
Csit77406Csit77406
Csit77406
csandit
 
Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...
tuxette
 
Random Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian NetworksRandom Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian Networks
University of Nantes
 
Statistical software for Sampling from Finite Populations: an analysis using ...
Statistical software for Sampling from Finite Populations: an analysis using ...Statistical software for Sampling from Finite Populations: an analysis using ...
Statistical software for Sampling from Finite Populations: an analysis using ...
michele de meo
 
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdfStatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
Anna Carbone
 
Statistics 101 for System Administrators
Statistics 101 for System AdministratorsStatistics 101 for System Administrators
Statistics 101 for System Administrators
Roberto Polli
 
Semantics
SemanticsSemantics
A Systematic Approach To Probabilistic Pointer Analysis
A Systematic Approach To Probabilistic Pointer AnalysisA Systematic Approach To Probabilistic Pointer Analysis
A Systematic Approach To Probabilistic Pointer Analysis
Monica Franklin
 

Similar to CORE: Context-Aware Open Relation Extraction with Factorization Machines (20)

3_MLE_printable.pdf
3_MLE_printable.pdf3_MLE_printable.pdf
3_MLE_printable.pdf
 
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
Session II - Estimation methods and accuracy - Brunero Liseo, Discussion
 
Recommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenuRecommandation sociale : filtrage collaboratif et par le contenu
Recommandation sociale : filtrage collaboratif et par le contenu
 
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
 
slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16
 
Class9_PCA_final.ppt
Class9_PCA_final.pptClass9_PCA_final.ppt
Class9_PCA_final.ppt
 
Nimrita koul Machine Learning
Nimrita koul  Machine LearningNimrita koul  Machine Learning
Nimrita koul Machine Learning
 
Lecture20 xing
Lecture20 xingLecture20 xing
Lecture20 xing
 
Building WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spacesBuilding WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spaces
 
Presentation of volesti in eRum 2020
Presentation of volesti in eRum 2020 Presentation of volesti in eRum 2020
Presentation of volesti in eRum 2020
 
Evoknow17 Large Scale Problems in Practice
Evoknow17 Large Scale Problems in PracticeEvoknow17 Large Scale Problems in Practice
Evoknow17 Large Scale Problems in Practice
 
Link Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: AccuracyLink Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: Accuracy
 
Csit77406
Csit77406Csit77406
Csit77406
 
Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...
 
Random Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian NetworksRandom Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian Networks
 
Statistical software for Sampling from Finite Populations: an analysis using ...
Statistical software for Sampling from Finite Populations: an analysis using ...Statistical software for Sampling from Finite Populations: an analysis using ...
Statistical software for Sampling from Finite Populations: an analysis using ...
 
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdfStatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
 
Statistics 101 for System Administrators
Statistics 101 for System AdministratorsStatistics 101 for System Administrators
Statistics 101 for System Administrators
 
Semantics
SemanticsSemantics
Semantics
 
A Systematic Approach To Probabilistic Pointer Analysis
A Systematic Approach To Probabilistic Pointer AnalysisA Systematic Approach To Probabilistic Pointer Analysis
A Systematic Approach To Probabilistic Pointer Analysis
 

Recently uploaded

KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
RadiNasr
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
Dr Ramhari Poudyal
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
enizeyimana36
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
Heat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation pptHeat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation ppt
mamunhossenbd75
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 

Recently uploaded (20)

KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
Heat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation pptHeat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation ppt
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 

CORE: Context-Aware Open Relation Extraction with Factorization Machines

  • 1. CORE: Context-Aware Open Relation Extraction with Factorization Machines Fabio Petroni Luciano Del Corro Rainer Gemulla
  • 2. Open relation extraction I Open relation extraction is the task of extracting new facts for a potentially unbounded set of relations from various sources natural language text knowledge bases EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 2 of 21
  • 3. Input data: facts from natural language text Enrico Fermi was a professor in theoretical physics at Sapienza University of Rome. "professor at"(Fermi,Sapienza) tuple sub objrel surface fact open information extractor extract all facts in textsurface relation EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 3 of 21
  • 4. Input data: facts from knowledge bases natural language text Fermi Sapienza employee(Fermi,Sapienza) KB fact employee KB relation "professor at"(Fermi,Sapienza) surface fact entity link e.g., string match heuristic EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 4 of 21
  • 5. Relation extraction techniques taxonomy distant supervision relation extraction tensor completion matrix completion RESCAL PITF NFE CORE relation clustering set of predefined relations "black and white" approach limited scalability with the number of relations; large prediction space latent factors models openclose in-KB in-KB out of-KB restricted prediction space (Nickel et al., 2011) (Drumond et al., 2012) (Petroni et al., 2015)(Riedel et al., 2013) EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 5 of 21
  • 6. Matrix completion for open relation extraction (Caesar,Rome) (Fermi,Rome) (Fermi,Sapienza) (de Blasio,NY) 1 1 1 1 1 KB relationsurface relation employeeborn in professor at mayor of tuples x relations EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 6 of 21
  • 7. Matrix completion for open relation extraction (Caesar,Rome) (Fermi,Rome) (Fermi,Sapienza) (de Blasio,NY) 1 1 1 1 1 KB relationsurface relation employeeborn in professor at mayor of tuples x relations ? ? ?? ? ?? ? ? ? ? EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 6 of 21
  • 8. Matrix factorization I learn latent semantic representations of tuples and relations tuple latent factor vector relation latent factor vector dot product I leverage latent representations to predict new facts professor at related with science0.8 (Fermi,Sapienza) 0.9 -0.5 -0.3 related with sport I in real applications latent factors are uninterpretable EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 7 of 21
  • 9. Matrix factorization CORE integrates contextual information into such models to improve prediction performance EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 7 of 21
  • 10. Contextual information Tom Peloso joined Modest Mouse to record their fifth studio album. person organization surface relation "join"(Peloso,Modest Mouse) unspecific relation entity types article topic words record album Contextual information named entity recognizer label entity with coarse- grained type EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 8 of 21
  • 11. Contextual information Tom Peloso joined Modest Mouse to record their fifth studio album. person organization surface relation "join"(Peloso,Modest Mouse) unspecific relation entity types article topic words record album Contextual information How to incorporate contextual information within the model? How to incorporate contextual information within the model? EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 8 of 21
  • 12. CORE - latent representations of variables I associates latent representations fv with each variable v 2 V Peloso (Peloso,Modest Mouse) Modest Mouse join person organization record album tuple relation entities context latent factor vectors EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 9 of 21
  • 13. CORE - modeling facts 1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4 0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0 “born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person, organization person, location physics history relations tuples entities tuple types tuple topics x1 x2 x3 x4 Surface KB Context … “professor at”(x,y) I models the input data in terms of a matrix in which each row corresponds to a fact x and each column to a variable v I groups columns according to the type of the variables I in each row the values of each column group sum up to unity EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 10 of 21
  • 14. CORE - modeling context 1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4 0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0 “born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person, organization person, location physics history relations tuples entities tuple types tuple topics x1 x2 x3 x4 Surface KB Context … “professor at”(x,y) I aggregates and normalizes contextual information by tuple B a fact can be observed multiple times with di↵erent context B there is no context for new facts (never observed in input) I this approach allows us to provide comprehensive contextual information for both observed and unobserved facts EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 11 of 21
  • 15. CORE - factorization model 1 0 0 1 0 0 0.5 0.5 0 0 0 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 1 0 1 0 1 0 0 0 1 0 0 0.5 0.5 0 0 1 0.6 0.4 0 0 1 0 0 1 0 0 0.5 0.5 1 0 1 0 “born in”(x,y) employee(x,y) Caesar,Rome Fermi,Rome Fermi,Sapienza Caesar Rome Fermi Sapienza person, organization person, location physics history relations tuples entities tuple types tuple topics x1 x2 x3 x4 Surface KB Context … “professor at”(x,y) I uses factorization machines as underlying framework I associates a score s(x) with each fact x s(x) = X v12V X v22V {v1} xv1 xv2 f T v1 fv2 I weighted pairwise interactions of latent factor vectors EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 12 of 21
  • 16. CORE - prediction produce a ranked list of tuples for each relation goal I rank reflects the likelihood that the corresponding fact is true I to generate this ranked list: B fix a relation r B retrieve all tuples t, s.t. the fact r(t) is not observed B add tuple context B rank unobserved facts by their scores EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 13 of 21
  • 17. CORE - parameter estimation I parameters: ⇥ = { bv , fv | v 2 V } I all our observations are positive, no negative training data I Bayesian personalized ranking, open-world assumption professor at location tuple entities tuple contextfact (Fermi,Sapienza) Fermi Sapienza organization physics person x (Caesar,Rome) professor at Caesar locationRome history tuple entities tuple contextfact person x-sampled negative evidenceobserved fact I pairwise approach, x is more likely to be true than x- maximize X x f (s(x) s(x-)) I stochastic gradient ascent ⇥ ⇥ + ⌘r⇥ ( ) EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 14 of 21
  • 18. Experiments - dataset 440k facts extracted from corpus 15k facts from entity mentions linked using string matching I Contextual information article metadata bag-of-word sentences where the fact has been extracted entity type person organization location miscellaneous news desk (e.g., foreign desk) descriptors (e.g., finances) online section (e.g., sports) section (e.g., a, d) publication year m t w I letters to indicate contextual information considered EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 15 of 21
  • 19. Experiments - methodology I we consider (to keep experiments feasible): 10k tuples 10 surface relations19 Freebase relations I for each relation and method: B we rank the tuples subsample B we consider the top-100 predictions and label them manually I evaluation metrics: number of true facts MAP (quality of the ranking) I methods: B PITF, tensor factorization method B NFE, matrix completion method (context-agnostic) B CORE, uses relations, tuples and entities as variables B CORE+m, +t, +w, +mt, +mtw EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 16 of 21
  • 20. Results - Freebase relations Relation # PITF NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw person/company 208 70 (0.47) 92 (0.81) 91 (0.83) 90 (0.84) 91 (0.87) 92 (0.87) 95 (0.93) 96 (0.94) person/place of birth 117 1 (0.0) 92 (0.9) 90 (0.88) 92 (0.9) 92 (0.9) 89 (0.87) 93 (0.9) 92 (0.9) location/containedby 102 7 (0.0) 63 (0.47) 62 (0.47) 63 (0.46) 61 (0.47) 61 (0.44) 62 (0.49) 68 (0.55) parent/child 88 9 (0.01) 64 (0.6) 64 (0.56) 64 (0.59) 64 (0.62) 64 (0.57) 67 (0.67) 68 (0.63) person/place of death 71 1 (0.0) 67 (0.93) 67 (0.92) 69 (0.94) 67 (0.93) 67 (0.92) 69 (0.94) 67 (0.92) person/parents 67 20 (0.1) 51 (0.64) 52 (0.62) 51 (0.61) 49 (0.64) 47 (0.6) 53 (0.67) 53 (0.65) author/works written 65 24 (0.08) 45 (0.59) 49 (0.62) 51 (0.69) 50 (0.68) 50 (0.68) 51 (0.7) 52 (0.67) person/nationality 61 21 (0.08) 25 (0.19) 27 (0.17) 28 (0.2) 26 (0.2) 29 (0.19) 27 (0.18) 27 (0.21) neighbor./neighborhood of 39 3 (0.0) 24 (0.44) 23 (0.45) 26 (0.5) 27 (0.47) 27 (0.49) 30 (0.51) 30 (0.52) ... Average MAP100 # 0.09 0.46 0.47 0.49 0.47 0.49 0.49 0.51 Weighted Average MAP100 # 0.14 0.64 0.64 0.66 0.67 0.66 0.70 0.70 0.5 0.55 0.6 0.65 0.7 NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw Weighted Average MAP 0.64 0.64 0.66 0.67 0.66 0.70 0.70 EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 17 of 21
  • 21. Results - surface relations Relation # PITF NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw head 162 34 (0.18) 80 (0.66) 83 (0.66) 82 (0.63) 76 (0.57) 77 (0.57) 83 (0.69) 88 (0.73) scientist 144 44 (0.17) 76 (0.6) 74 (0.55) 73 (0.56) 74 (0.6) 73 (0.59) 78 (0.66) 78 (0.69) base 133 10 (0.01) 85 (0.71) 86 (0.71) 86 (0.78) 88 (0.79) 85 (0.75) 83 (0.76) 89 (0.8) visit 118 4 (0.0) 73 (0.6) 75 (0.61) 76 (0.64) 80 (0.68) 74 (0.64) 75 (0.66) 82 (0.74) attend 92 11 (0.02) 65 (0.58) 64 (0.59) 65 (0.63) 62 (0.6) 66 (0.63) 62 (0.58) 69 (0.64) adviser 56 2 (0.0) 42 (0.56) 47 (0.58) 44 (0.58) 43 (0.59) 45 (0.63) 43 (0.53) 44 (0.63) criticize 40 5 (0.0) 31 (0.66) 33 (0.62) 33 (0.7) 33 (0.67) 33 (0.61) 35 (0.69) 37 (0.69) support 33 3 (0.0) 19 (0.27) 22 (0.28) 18 (0.21) 19 (0.28) 22 (0.27) 23 (0.27) 21 (0.27) praise 5 0 (0.0) 2 (0.0) 2 (0.01) 4 (0.03) 3 (0.01) 3 (0.02) 5 (0.03) 2 (0.01) vote 3 2 (0.01) 3 (0.63) 3 (0.63) 3 (0.32) 3 (0.49) 3 (0.51) 3 (0.59) 3 (0.64) Average MAP100 # 0.04 0.53 0.53 0.51 0.53 0.53 0.55 0.59 Weighted Average MAP100 # 0.08 0.62 0.61 0.63 0.63 0.61 0.65 0.70 0.5 0.6 0.7 NFE CORE CORE+m CORE+t CORE+w CORE+mt CORE+mtw Weighted Average MAP 0.62 0.61 0.63 0.63 0.61 0.65 0.70 EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 18 of 21
  • 22. Anecdotal results author(x,y) ranked list of tuples 1 (Winston Groom, Forrest Gump) 2 (D. M. Thomas, White Hotel) 3 (Roger Rosenblatt, Life Itself) 4 (Edmund White, Skinned Alive) 5 (Peter Manso, Brando: The Biography) similar relations 0.98 “reviews x by y”(x,y) 0.97 “book by”(x,y) 0.95 “author of”(x,y) 0.95 ” ‘s novel”(x,y) 0.95 “ ‘s book”(x,y) similar relations 0.87 “scientist”(x,y) 0.84 “scientist with”(x,y) 0.80 “professor at”(x,y) 0.79 “scientist for”(x,y) 0.78 “neuroscientist at”(x,y) ranked list of tuples 1 (Riordan Roett, Johns Hopkins University) 2 (Dr. R. M. Roberts, University of Missouri) 3 (Linda Mayes, Yale University) 4 (Daniel T. Jones, Cardiff Business School) 5 (Russell Ross, University of Iowa) “scientist at”(x,y) I semantic similarity of relations is one aspect of our model I similar relations treated di↵erently in di↵erent contexts EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 19 of 21
  • 23. Conclusion I CORE, a matrix factorization model for open relation extraction that incorporates contextual information I based on factorization machines and open-world assumption I extensible model, additional contextual information can be integrated when available I experimental study suggests that exploiting context can significantly improve prediction performance I Source code, datasets, and supporting material are available at https://github.com/fabiopetroni/CORE EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 20 of 21
  • 24. Thank you! Questions? Fabio Petroni Sapienza University of Rome, Italy Current position: PhD Student in Engineering in Computer Science Research Interests: data mining, machine learning, big data petroni@dis.uniroma1.it EMNLP 2015. September 17-21, 2015. Lisbon, Portugal. 21 of 21