SlideShare a Scribd company logo
1 of 56
Download to read offline
A Survey on Open Information Extraction
COLING 2018
Christina Niklaus1, Matthias Cetto1, André Freitas2
and Siegfried Handschuh1
1Natural Language Processing and Semantic Computing
University of Passau
2School of Computer Science
University of Manchester
August 24, 2018
1
Information Extraction
Task of IE: distill semantic relations from NL text
“Barack Obama was born in 1961.”
Barack Obama; was born in; 1961
Traditional IE:
hand-labeled data
pre-defined set of target relations (supervised approach)
small, homogeneous corpora
scalability to large, heterogeneous corpora?
Motivation 2
Information Extraction
Task of IE: distill semantic relations from NL text
“Barack Obama was born in 1961.”
Barack Obama; was born in; 1961
Traditional IE:
hand-labeled data
pre-defined set of target relations (supervised approach)
small, homogeneous corpora
scalability to large, heterogeneous corpora?
Motivation 2
Information Extraction
Task of IE: distill semantic relations from NL text
“Barack Obama was born in 1961.”
Barack Obama; was born in; 1961
Traditional IE:
hand-labeled data
pre-defined set of target relations (supervised approach)
small, homogeneous corpora
scalability to large, heterogeneous corpora?
Motivation 2
Information Extraction
Task of IE: distill semantic relations from NL text
“Barack Obama was born in 1961.”
Barack Obama; was born in; 1961
Traditional IE:
hand-labeled data
pre-defined set of target relations (supervised approach)
small, homogeneous corpora
scalability to large, heterogeneous corpora?
Motivation 2
Information Extraction
Task of IE: distill semantic relations from NL text
“Barack Obama was born in 1961.”
Barack Obama; was born in; 1961
Traditional IE:
hand-labeled data
pre-defined set of target relations (supervised approach)
small, homogeneous corpora
scalability to large, heterogeneous corpora?
Motivation 2
Open Information Extraction
Introduction of a new extraction paradigm: Open IE (Banko
et al., 2007)
Challenges of Open IE systems:
1. automation: automatic discovery of relations (unsupervised
approach)
2. corpus heterogeneity: domain-independent usage
3. efficiency: readily scale to large amounts of text
Motivation 3
Open Information Extraction
Introduction of a new extraction paradigm: Open IE (Banko
et al., 2007)
Challenges of Open IE systems:
1. automation: automatic discovery of relations (unsupervised
approach)
2. corpus heterogeneity: domain-independent usage
3. efficiency: readily scale to large amounts of text
Motivation 3
Early Approaches in Open IE
1. hand-crafted extraction patterns:
A human manually defines a set of extraction rules.
2. self-supervised learning:
The system automatically finds and labels its own training
examples.
Open IE Approaches Early Approaches in Open IE 4
Early Approaches in Open IE
1. hand-crafted extraction patterns:
A human manually defines a set of extraction rules.
2. self-supervised learning:
The system automatically finds and labels its own training
examples.
Open IE Approaches Early Approaches in Open IE 4
Early Approaches in Open IE
1. hand-crafted extraction patterns:
A human manually defines a set of extraction rules.
2. self-supervised learning:
The system automatically finds and labels its own training
examples.
Open IE Approaches Early Approaches in Open IE 4
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Open IE Approaches Early Approaches in Open IE 5
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Open IE Approaches Early Approaches in Open IE 5
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Open IE Approaches Early Approaches in Open IE 5
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Open IE Approaches Early Approaches in Open IE 5
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 5
TextRunner (Banko et al., 2007)
1. self-supervised learner: heuristically identifies and labels a
set of extractions as examples to train a model of relations
using unlexicalized features
2. extractor: makes a single pass over the corpus to extract
tuples for all possible relations
3. redundancy-based assessor: assigns a probability to each
tuple based on the number of sentences from which each
extraction was found
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
The novelist Franz Kafka; is; the author of a short story
Open IE Approaches Early Approaches in Open IE 5
Problems with TextRunner
incoherent extractions: relational phrase has no meaningful
interpretation
uninformative extractions: omit critical information
Open IE Approaches Early Approaches in Open IE 6
Problems with TextRunner
incoherent extractions: relational phrase has no meaningful
interpretation
uninformative extractions: omit critical information
Open IE Approaches Early Approaches in Open IE 6
Problems with TextRunner
incoherent extractions: relational phrase has no meaningful
interpretation
uninformative extractions: omit critical information
Open IE Approaches Early Approaches in Open IE 7
Problems with TextRunner
incoherent extractions: relational phrase has no meaningful
interpretation
uninformative extractions: omit critical information
ReVerb (Fader et al., 2011)
Open IE Approaches Early Approaches in Open IE 7
ReVerb (Fader et al., 2011)
find longest phrase matching a simple syntactic constraint
to avoid overspecified relational phrases, a lexical constraint
is introduced: |args(rel)| > k
Open IE Approaches Early Approaches in Open IE 8
ReVerb (Fader et al., 2011)
find longest phrase matching a simple syntactic constraint
to avoid overspecified relational phrases, a lexical constraint
is introduced: |args(rel)| > k
Open IE Approaches Early Approaches in Open IE 8
ReVerb (Fader et al., 2011)
find longest phrase matching a simple syntactic constraint
to avoid overspecified relational phrases, a lexical constraint
is introduced: |args(rel)| > k
Open IE Approaches Early Approaches in Open IE 8
ReVerb (Fader et al., 2011)
find longest phrase matching a simple syntactic constraint
to avoid overspecified relational phrases, a lexical constraint
is introduced: |args(rel)| > k
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 8
ReVerb (Fader et al., 2011)
find longest phrase matching a simple syntactic constraint
to avoid overspecified relational phrases, a lexical constraint
is introduced: |args(rel)| > k
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
The novelist Franz Kafka; is the author of; a short story
Open IE Approaches Early Approaches in Open IE 8
Problem with ReVerb
limited to relations that are mediated by verbs
Open IE Approaches Early Approaches in Open IE 9
Problem with ReVerb
limited to relations that are mediated by verbs
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 9
Problem with ReVerb
limited to relations that are mediated by verbs
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 9
Problem with ReVerb
limited to relations that are mediated by verbs
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 9
Problem with ReVerb
limited to relations that are mediated by verbs
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
identification of relationships me-
diated by nouns and adjectives:
OLLIE (Mausam et al., 2012)
Open IE Approaches Early Approaches in Open IE 9
OLLIE (Mausam et al., 2012)
applies a set of high precision seed tuples from ReVerb
to bootstrap a large training set
over which it learns a set of extraction pattern templates
using dependency parses
Open IE Approaches Early Approaches in Open IE 10
OLLIE (Mausam et al., 2012)
sample open pattern templates:
applied to individual sentences at extraction time
Open IE Approaches Early Approaches in Open IE 11
OLLIE (Mausam et al., 2012)
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Open IE Approaches Early Approaches in Open IE 12
OLLIE (Mausam et al., 2012)
Example
The novelist Franz Kafka is the author of a short story entitled
”The Metamorphosis”.
Franz Kafka; is the author of; a short story
Franz Kafka; is; a novelist
a short story; be entitled; ”The Metamorphosis”
Open IE Approaches Early Approaches in Open IE 12
Problem with OLLIE (and previous approaches)
limited to the extraction of binary relations
Open IE Approaches Early Approaches in Open IE 13
Problem with OLLIE (and previous approaches)
limited to the extraction of binary relations
Example
Franz Kafka was born in Prague in 1883.
Open IE Approaches Early Approaches in Open IE 13
Problem with OLLIE (and previous approaches)
limited to the extraction of binary relations
Example
Franz Kafka was born into a Jewish family in Prague in 1883.
ReVerb: Franz Kafka; was born into; a Jewish family
Open IE Approaches Early Approaches in Open IE 13
Problem with OLLIE (and previous approaches)
limited to the extraction of binary relations
Example
Franz Kafka was born into a Jewish family in Prague in 1883 .
ReVerb: Franz Kafka; was born into; a Jewish family
Open IE Approaches Early Approaches in Open IE 13
Problem with OLLIE (and previous approaches)
limited to the extraction of binary relations
Example
Franz Kafka was born into a Jewish family in Prague in 1883 .
ReVerb: Franz Kafka; was born into; a Jewish family
capture complete facts from sentences
by gathering the full set of arguments for
each relational phrase (n-ary relations):
KrakeN (Akbik and Löser, 2012)
Exemplar (Mesquita et al., 2013)
Open IE Approaches Early Approaches in Open IE 13
KrakeN and Exemplar
hand-crafted extraction rules over dependency parses
Open IE Approaches Early Approaches in Open IE 14
KrakeN and Exemplar
hand-crafted extraction rules over dependency parses
Example
Franz Kafka was born into a Jewish family in Prague in 1883 .
Franz Kafka; was born; (into) a Jewish family; (in) Prague; (in)
1883
Open IE Approaches Early Approaches in Open IE 14
Paraphrase-based Approaches
Previous approaches often produce erroneous extractions on
syntactically complex sentences
Open IE Approaches Paraphrase-based Approaches 15
Paraphrase-based Approaches
Previous approaches often produce erroneous extractions on
syntactically complex sentences
generation of an intermediate rep-
resentation using a sentence re-
structuring stage:
ClausIE (Del Corro and
Gemulla, 2013)
Schmidek and Barbosa (2014)
Stanford Open IE (Angeli et al.,
2015)
Open IE Approaches Paraphrase-based Approaches 15
Problem with Paraphrase-based Approaches
lack the expressiveness needed for a proper interpretation of
complex assertions taking into account the context under which a
proposition is complete and correct
Open IE Approaches Paraphrase-based Approaches 16
Problem with Paraphrase-based Approaches
lack the expressiveness needed for a proper interpretation of
complex assertions taking into account the context under which a
proposition is complete and correct
systems that capture inter-proposition
relationships
Open IE Approaches Paraphrase-based Approaches 16
OLLIE and its Successors Open IE 4 and 5
OLLIE: extra field to distinguish between information
asserted in a sentence and information that is only
hypothetical or conditionally true
Open IE 4 and 5: mark up temporal and local context
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
OLLIE and its Successors Open IE 4 and 5
OLLIE: extra field to distinguish between information
asserted in a sentence and information that is only
hypothetical or conditionally true
Example
Romney will be elected President if he wins five key states.
( Romney; will be elected; President ;
ClausalModifier if; he wins five key states)
Open IE 4 and 5: mark up temporal and local context
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
OLLIE and its Successors Open IE 4 and 5
OLLIE: extra field to distinguish between information
asserted in a sentence and information that is only
hypothetical or conditionally true
Open IE 4 and 5: mark up temporal and local context
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
OLLIE and its Successors Open IE 4 and 5
OLLIE: extra field to distinguish between information
asserted in a sentence and information that is only
hypothetical or conditionally true
Open IE 4 and 5: mark up temporal and local context
Example
Franz Kafka was born into a Jewish family in Prague
in 1883 .
Franz Kafka; was born; into a Jewish family; L: in Prague;
T: in 1883
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
Graphene (Cetto et al., 2018)
Lightweight semantic representation:
two-layered hierarchy of core relational tuples and
accompanying contextual information that are
semantically linked via rhetorical relations
(3)
rhetorical rela-
tion identification
(2)
constituency
type clas-
sification
(1)
syntactic
sentence
simplification
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 18
Graphene (Cetto et al., 2018)
“Although the Treasury will announce details of the November
refunding on Monday, the funding will be delayed if Congress and
President Bush fail to increase the Treasury’s borrowing capacity.”
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 19
Graphene (Cetto et al., 2018)
“Although the Treasury will announce details of the November
refunding on Monday, the funding will be delayed if Congress and
President Bush fail to increase the Treasury’s borrowing capacity.”
Open IE Approaches Approaches Capturing Inter-Proposition Relationships 19
Problems in the Evaluation of Open IE Systems
no clear formal specification of what constitutes a valid
relational tuple
no established, large-scale annotated corpus serving as a gold
standard dataset
Evaluation 20
Problems in the Evaluation of Open IE Systems
no clear formal specification of what constitutes a valid
relational tuple
no established, large-scale annotated corpus serving as a gold
standard dataset
scalability to large amounts of text?
portability to various genres of text?
objective and reproducible
cross-system comparison?
Evaluation 20
Open Research Questions
large-scale gold standard evaluation
dataset allowing for an objective and
reproducible cross-system comparison
applicability and transferability of the
proposed Open IE approaches to
languages other than English
canonicalization of relational phrases
and arguments
Open Research Questions 21

More Related Content

Similar to Survey on Open IE

An-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingAn-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingTheodore J. LaGrow
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...ijnlc
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014Bianca Pereira
 
EXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBEXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBijcsit
 
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...dannyijwest
 
Hyponymy extraction of domain ontology
Hyponymy extraction of domain ontologyHyponymy extraction of domain ontology
Hyponymy extraction of domain ontologyIJwest
 
Capturing and leveraging materials science knowledge from millions of journal...
Capturing and leveraging materials science knowledge from millions of journal...Capturing and leveraging materials science knowledge from millions of journal...
Capturing and leveraging materials science knowledge from millions of journal...Anubhav Jain
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Anubhav Jain
 
Lecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsLecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsMarina Santini
 
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...ijnlc
 
601-CriticalEssay-2-Portfolio Edition
601-CriticalEssay-2-Portfolio Edition601-CriticalEssay-2-Portfolio Edition
601-CriticalEssay-2-Portfolio EditionJordan Chapman
 
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News CorpusUnsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News Corpushtanev
 

Similar to Survey on Open IE (20)

An-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingAn-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-Processing
 
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...A N H YBRID  A PPROACH TO  W ORD  S ENSE  D ISAMBIGUATION  W ITH  A ND  W ITH...
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014
 
Karlgren
KarlgrenKarlgren
Karlgren
 
EXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBEXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEB
 
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...
HYPONYMY EXTRACTION OF DOMAIN ONTOLOGY CONCEPT BASED ON CCRFS AND HIERARCHY C...
 
Hyponymy extraction of domain ontology
Hyponymy extraction of domain ontologyHyponymy extraction of domain ontology
Hyponymy extraction of domain ontology
 
Capturing and leveraging materials science knowledge from millions of journal...
Capturing and leveraging materials science knowledge from millions of journal...Capturing and leveraging materials science knowledge from millions of journal...
Capturing and leveraging materials science knowledge from millions of journal...
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
Lecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsLecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented Applications
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Using lexical chains for text summarization
Using lexical chains for text summarizationUsing lexical chains for text summarization
Using lexical chains for text summarization
 
Meghyn slides-hse-2014
Meghyn slides-hse-2014Meghyn slides-hse-2014
Meghyn slides-hse-2014
 
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
 
NLP todo
NLP todoNLP todo
NLP todo
 
FinalReport
FinalReportFinalReport
FinalReport
 
601-CriticalEssay-2-Portfolio Edition
601-CriticalEssay-2-Portfolio Edition601-CriticalEssay-2-Portfolio Edition
601-CriticalEssay-2-Portfolio Edition
 
Study_Report
Study_ReportStudy_Report
Study_Report
 
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News CorpusUnsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
 

Recently uploaded

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 

Recently uploaded (20)

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 

Survey on Open IE

  • 1. A Survey on Open Information Extraction COLING 2018 Christina Niklaus1, Matthias Cetto1, André Freitas2 and Siegfried Handschuh1 1Natural Language Processing and Semantic Computing University of Passau 2School of Computer Science University of Manchester August 24, 2018 1
  • 2. Information Extraction Task of IE: distill semantic relations from NL text “Barack Obama was born in 1961.” Barack Obama; was born in; 1961 Traditional IE: hand-labeled data pre-defined set of target relations (supervised approach) small, homogeneous corpora scalability to large, heterogeneous corpora? Motivation 2
  • 3. Information Extraction Task of IE: distill semantic relations from NL text “Barack Obama was born in 1961.” Barack Obama; was born in; 1961 Traditional IE: hand-labeled data pre-defined set of target relations (supervised approach) small, homogeneous corpora scalability to large, heterogeneous corpora? Motivation 2
  • 4. Information Extraction Task of IE: distill semantic relations from NL text “Barack Obama was born in 1961.” Barack Obama; was born in; 1961 Traditional IE: hand-labeled data pre-defined set of target relations (supervised approach) small, homogeneous corpora scalability to large, heterogeneous corpora? Motivation 2
  • 5. Information Extraction Task of IE: distill semantic relations from NL text “Barack Obama was born in 1961.” Barack Obama; was born in; 1961 Traditional IE: hand-labeled data pre-defined set of target relations (supervised approach) small, homogeneous corpora scalability to large, heterogeneous corpora? Motivation 2
  • 6. Information Extraction Task of IE: distill semantic relations from NL text “Barack Obama was born in 1961.” Barack Obama; was born in; 1961 Traditional IE: hand-labeled data pre-defined set of target relations (supervised approach) small, homogeneous corpora scalability to large, heterogeneous corpora? Motivation 2
  • 7. Open Information Extraction Introduction of a new extraction paradigm: Open IE (Banko et al., 2007) Challenges of Open IE systems: 1. automation: automatic discovery of relations (unsupervised approach) 2. corpus heterogeneity: domain-independent usage 3. efficiency: readily scale to large amounts of text Motivation 3
  • 8. Open Information Extraction Introduction of a new extraction paradigm: Open IE (Banko et al., 2007) Challenges of Open IE systems: 1. automation: automatic discovery of relations (unsupervised approach) 2. corpus heterogeneity: domain-independent usage 3. efficiency: readily scale to large amounts of text Motivation 3
  • 9. Early Approaches in Open IE 1. hand-crafted extraction patterns: A human manually defines a set of extraction rules. 2. self-supervised learning: The system automatically finds and labels its own training examples. Open IE Approaches Early Approaches in Open IE 4
  • 10. Early Approaches in Open IE 1. hand-crafted extraction patterns: A human manually defines a set of extraction rules. 2. self-supervised learning: The system automatically finds and labels its own training examples. Open IE Approaches Early Approaches in Open IE 4
  • 11. Early Approaches in Open IE 1. hand-crafted extraction patterns: A human manually defines a set of extraction rules. 2. self-supervised learning: The system automatically finds and labels its own training examples. Open IE Approaches Early Approaches in Open IE 4
  • 12. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Open IE Approaches Early Approaches in Open IE 5
  • 13. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Open IE Approaches Early Approaches in Open IE 5
  • 14. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Open IE Approaches Early Approaches in Open IE 5
  • 15. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Open IE Approaches Early Approaches in Open IE 5
  • 16. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 5
  • 17. TextRunner (Banko et al., 2007) 1. self-supervised learner: heuristically identifies and labels a set of extractions as examples to train a model of relations using unlexicalized features 2. extractor: makes a single pass over the corpus to extract tuples for all possible relations 3. redundancy-based assessor: assigns a probability to each tuple based on the number of sentences from which each extraction was found Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. The novelist Franz Kafka; is; the author of a short story Open IE Approaches Early Approaches in Open IE 5
  • 18. Problems with TextRunner incoherent extractions: relational phrase has no meaningful interpretation uninformative extractions: omit critical information Open IE Approaches Early Approaches in Open IE 6
  • 19. Problems with TextRunner incoherent extractions: relational phrase has no meaningful interpretation uninformative extractions: omit critical information Open IE Approaches Early Approaches in Open IE 6
  • 20. Problems with TextRunner incoherent extractions: relational phrase has no meaningful interpretation uninformative extractions: omit critical information Open IE Approaches Early Approaches in Open IE 7
  • 21. Problems with TextRunner incoherent extractions: relational phrase has no meaningful interpretation uninformative extractions: omit critical information ReVerb (Fader et al., 2011) Open IE Approaches Early Approaches in Open IE 7
  • 22. ReVerb (Fader et al., 2011) find longest phrase matching a simple syntactic constraint to avoid overspecified relational phrases, a lexical constraint is introduced: |args(rel)| > k Open IE Approaches Early Approaches in Open IE 8
  • 23. ReVerb (Fader et al., 2011) find longest phrase matching a simple syntactic constraint to avoid overspecified relational phrases, a lexical constraint is introduced: |args(rel)| > k Open IE Approaches Early Approaches in Open IE 8
  • 24. ReVerb (Fader et al., 2011) find longest phrase matching a simple syntactic constraint to avoid overspecified relational phrases, a lexical constraint is introduced: |args(rel)| > k Open IE Approaches Early Approaches in Open IE 8
  • 25. ReVerb (Fader et al., 2011) find longest phrase matching a simple syntactic constraint to avoid overspecified relational phrases, a lexical constraint is introduced: |args(rel)| > k Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 8
  • 26. ReVerb (Fader et al., 2011) find longest phrase matching a simple syntactic constraint to avoid overspecified relational phrases, a lexical constraint is introduced: |args(rel)| > k Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. The novelist Franz Kafka; is the author of; a short story Open IE Approaches Early Approaches in Open IE 8
  • 27. Problem with ReVerb limited to relations that are mediated by verbs Open IE Approaches Early Approaches in Open IE 9
  • 28. Problem with ReVerb limited to relations that are mediated by verbs Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 9
  • 29. Problem with ReVerb limited to relations that are mediated by verbs Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 9
  • 30. Problem with ReVerb limited to relations that are mediated by verbs Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 9
  • 31. Problem with ReVerb limited to relations that are mediated by verbs Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. identification of relationships me- diated by nouns and adjectives: OLLIE (Mausam et al., 2012) Open IE Approaches Early Approaches in Open IE 9
  • 32. OLLIE (Mausam et al., 2012) applies a set of high precision seed tuples from ReVerb to bootstrap a large training set over which it learns a set of extraction pattern templates using dependency parses Open IE Approaches Early Approaches in Open IE 10
  • 33. OLLIE (Mausam et al., 2012) sample open pattern templates: applied to individual sentences at extraction time Open IE Approaches Early Approaches in Open IE 11
  • 34. OLLIE (Mausam et al., 2012) Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Open IE Approaches Early Approaches in Open IE 12
  • 35. OLLIE (Mausam et al., 2012) Example The novelist Franz Kafka is the author of a short story entitled ”The Metamorphosis”. Franz Kafka; is the author of; a short story Franz Kafka; is; a novelist a short story; be entitled; ”The Metamorphosis” Open IE Approaches Early Approaches in Open IE 12
  • 36. Problem with OLLIE (and previous approaches) limited to the extraction of binary relations Open IE Approaches Early Approaches in Open IE 13
  • 37. Problem with OLLIE (and previous approaches) limited to the extraction of binary relations Example Franz Kafka was born in Prague in 1883. Open IE Approaches Early Approaches in Open IE 13
  • 38. Problem with OLLIE (and previous approaches) limited to the extraction of binary relations Example Franz Kafka was born into a Jewish family in Prague in 1883. ReVerb: Franz Kafka; was born into; a Jewish family Open IE Approaches Early Approaches in Open IE 13
  • 39. Problem with OLLIE (and previous approaches) limited to the extraction of binary relations Example Franz Kafka was born into a Jewish family in Prague in 1883 . ReVerb: Franz Kafka; was born into; a Jewish family Open IE Approaches Early Approaches in Open IE 13
  • 40. Problem with OLLIE (and previous approaches) limited to the extraction of binary relations Example Franz Kafka was born into a Jewish family in Prague in 1883 . ReVerb: Franz Kafka; was born into; a Jewish family capture complete facts from sentences by gathering the full set of arguments for each relational phrase (n-ary relations): KrakeN (Akbik and Löser, 2012) Exemplar (Mesquita et al., 2013) Open IE Approaches Early Approaches in Open IE 13
  • 41. KrakeN and Exemplar hand-crafted extraction rules over dependency parses Open IE Approaches Early Approaches in Open IE 14
  • 42. KrakeN and Exemplar hand-crafted extraction rules over dependency parses Example Franz Kafka was born into a Jewish family in Prague in 1883 . Franz Kafka; was born; (into) a Jewish family; (in) Prague; (in) 1883 Open IE Approaches Early Approaches in Open IE 14
  • 43. Paraphrase-based Approaches Previous approaches often produce erroneous extractions on syntactically complex sentences Open IE Approaches Paraphrase-based Approaches 15
  • 44. Paraphrase-based Approaches Previous approaches often produce erroneous extractions on syntactically complex sentences generation of an intermediate rep- resentation using a sentence re- structuring stage: ClausIE (Del Corro and Gemulla, 2013) Schmidek and Barbosa (2014) Stanford Open IE (Angeli et al., 2015) Open IE Approaches Paraphrase-based Approaches 15
  • 45. Problem with Paraphrase-based Approaches lack the expressiveness needed for a proper interpretation of complex assertions taking into account the context under which a proposition is complete and correct Open IE Approaches Paraphrase-based Approaches 16
  • 46. Problem with Paraphrase-based Approaches lack the expressiveness needed for a proper interpretation of complex assertions taking into account the context under which a proposition is complete and correct systems that capture inter-proposition relationships Open IE Approaches Paraphrase-based Approaches 16
  • 47. OLLIE and its Successors Open IE 4 and 5 OLLIE: extra field to distinguish between information asserted in a sentence and information that is only hypothetical or conditionally true Open IE 4 and 5: mark up temporal and local context Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
  • 48. OLLIE and its Successors Open IE 4 and 5 OLLIE: extra field to distinguish between information asserted in a sentence and information that is only hypothetical or conditionally true Example Romney will be elected President if he wins five key states. ( Romney; will be elected; President ; ClausalModifier if; he wins five key states) Open IE 4 and 5: mark up temporal and local context Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
  • 49. OLLIE and its Successors Open IE 4 and 5 OLLIE: extra field to distinguish between information asserted in a sentence and information that is only hypothetical or conditionally true Open IE 4 and 5: mark up temporal and local context Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
  • 50. OLLIE and its Successors Open IE 4 and 5 OLLIE: extra field to distinguish between information asserted in a sentence and information that is only hypothetical or conditionally true Open IE 4 and 5: mark up temporal and local context Example Franz Kafka was born into a Jewish family in Prague in 1883 . Franz Kafka; was born; into a Jewish family; L: in Prague; T: in 1883 Open IE Approaches Approaches Capturing Inter-Proposition Relationships 17
  • 51. Graphene (Cetto et al., 2018) Lightweight semantic representation: two-layered hierarchy of core relational tuples and accompanying contextual information that are semantically linked via rhetorical relations (3) rhetorical rela- tion identification (2) constituency type clas- sification (1) syntactic sentence simplification Open IE Approaches Approaches Capturing Inter-Proposition Relationships 18
  • 52. Graphene (Cetto et al., 2018) “Although the Treasury will announce details of the November refunding on Monday, the funding will be delayed if Congress and President Bush fail to increase the Treasury’s borrowing capacity.” Open IE Approaches Approaches Capturing Inter-Proposition Relationships 19
  • 53. Graphene (Cetto et al., 2018) “Although the Treasury will announce details of the November refunding on Monday, the funding will be delayed if Congress and President Bush fail to increase the Treasury’s borrowing capacity.” Open IE Approaches Approaches Capturing Inter-Proposition Relationships 19
  • 54. Problems in the Evaluation of Open IE Systems no clear formal specification of what constitutes a valid relational tuple no established, large-scale annotated corpus serving as a gold standard dataset Evaluation 20
  • 55. Problems in the Evaluation of Open IE Systems no clear formal specification of what constitutes a valid relational tuple no established, large-scale annotated corpus serving as a gold standard dataset scalability to large amounts of text? portability to various genres of text? objective and reproducible cross-system comparison? Evaluation 20
  • 56. Open Research Questions large-scale gold standard evaluation dataset allowing for an objective and reproducible cross-system comparison applicability and transferability of the proposed Open IE approaches to languages other than English canonicalization of relational phrases and arguments Open Research Questions 21