1. Word Frame Disambiguation:
Evaluating Linguistic Linked
Data on Frame Detection
Mehwish Alam1, Aldo Gangemi1,2, Valentina Presutti2
1LIPN, Paris Nord University, CNRS UMR7030, France
2Semantic Technology Lab, ISTC-CNR, Rome, Italy
7. dataset nodes
blue: role-oriented lexical resources
purple: emotion-oriented lexical resources
red: fact-oriented data
green: wordnet-like lexical resources
yellow: ontology schemas
grey: topic models
dotted line: existing RDF data
continuous line: newly created RDF data
Originally, not many RDF datasets
linked in the word-lexicon-data space
arrows
orange: Framester links
black dotted: previous links
8. dataset nodes
blue: role-oriented lexical resources
purple: emotion-oriented lexical resources
red: fact-oriented data
green: wordnet-like lexical resources
yellow: ontology schemas
grey: topic models
dotted line: existing RDF data
continuous line: newly created RDF data
We added more RDF datasets linked in
the word-lexicon-data space
arrows
orange: Framester links
black dotted: previous links
9. dataset nodes
blue: role-oriented lexical resources
purple: emotion-oriented lexical resources
orange: fact-oriented data
green: wordnet-like lexical resources
yellow: ontology schemas
grey: topic models
dotted line: existing RDF data
continuous line: newly created RDF data
arrows
orange: Framester links
black dotted: previous links
We added many new
links so creating a
new formal resource
in the word-lexicon-
data space
10. Sample triples
• wn30instances:synset-anti-G_suit-noun-1
wn30schema:containsWordSense wn30instances:wordsense-
anti-G_suit-noun-1 , wn30instances:wordsense-G_suit-
noun-1 ; wn30schema:gloss “worn by fliers and
astronauts to counteract the forces of gravity and
acceleration” .
• wn30instances:synset-anti-G_suit-noun-1
own2dul:proxhyp wn30instances:synset-pressure_suit-
noun-1 ; own2dul:hyp wn30instances:synset-
consumer_goods-noun-1 ; own2dul:d0
dul:PhysicalObject .
• wn30instances:synset-anti-G_suit-noun-1 a
fschema:SynsetFrame ; fschema:unaryProjectionOf
frame:Clothing , frame:Artifact , frame:Wearing ,
frame:Dressing .
wn30
own
framester
16. Framester semantics 1/3
• A frame is defined as a multigrade predicate 𝜙(e,x1, ..., xn), where
𝜙 is a first-order relation, e is a (neo-Davidsonian) variable for any
eventuality or state of affairs described by the frame, and xi is a
variable for any argument place. Interpretation of predicates is
made on a domain ∆
I
of
• D&S-style Punning
• 𝜙
I
⊆ dands:Situation
I
• 𝜙 ∈ fschema:Frame
I
(⊆ dands:Description
I
)
• Actual frame occurrences
• s ∈ fschema:Situation
I
, 𝜙
I
17. Framester semantics 2/3
• Projections
• A semantic role is a internal binary projection rol(e,xi)
of a frame 𝜙, so that rol(e,xi) → 𝜙(e,x1, …,xn), i≥1≤n
• A co-participation relation is an external binary
projection cop(xj,xk) of a frame 𝜙, so that cop(xj,xk) →
𝜙(e,x1, …,xn), j≥1≤n , k≥1≤n
• A selectional restriction or semantic type is a unary
projection typ(xm) of a frame 𝜙, so that typ(xm) →
𝜙(e,x1, …,xn), m≥1≤n
18. Framester semantics 3/3
• Individuals and words
• A (non-situational) individual entity ent has a role in a
possible occurrence of a frame 𝜙 when ent ∈ typI
, i.e.
when it is an instance of a type compatible (or coerced) as
a unary projection of 𝜙
• An individual tuple is a possible occurrence of a frame 𝜙
when <x,y> ∈ rolI
, or <x,y> ∈ copI
, i.e. when it is a
instance of a property compatible (or coerced) as a binary
projection of 𝜙
• A word is an evocation of a frame 𝜙 when it can be
disambiguated to a frame or one of its projections
19. Consequences
• WordNet synsets are unary projections of frames (synset-based frames)
• WordNet word senses are unary projections of lexical units (sense-based frames)
• WordNet “tropes” are binary projections of implicit synset-based frames
• VerbNet verb (sub-)classes are frames
• VerbNet verb class members are sense-based frames
• LD properties are binary projections of frames (either internal or external)
• LD classes are either (candidate) frames or unary projections of frames
• LD regular individuals are instances of unary projections of frames (role players in an external
data frame)
• LD qua-individuals (e.g. DBpedia career stations) are instances of unary projections of a
specific frame
• LD assertions are instances of binary projections of (?external) frames
20. Achievements
• more than 40 million triples including new LOD versions of many, linguistic/factual resources,
and links among them, and to Framester
• formal schema interoperability across datasets
• full revision of WordNet-FrameNet mappings
• large extension of frame coverage
• frame annotations for any kind of entity
• full mapping of local (frame-dependent), and global roles from multiple resources
• new semantic role taxonomy from localised roles way up to abstract roles and dependencies
• alignment of frames, roles and types to foundational ontologies
• new frame relations discovered based on mappings and inferences
• Word Frame Disambiguation service
21. Consequent issues
• Many wrong mappings e.g. in FrameBase-WordNet
• Many inaccurate subsumptions and cycles in FrameNet
frame elements because of heterogeneous inheritance/
causal semantics
• Other mixed errors in FrameNet, e.g. when composing
formal assumptions from frame/role taxonomies
• Errors in stand-off WordNet files (specially with
teleological and derivational morpho-semantics datasets)
• …
25. R&D
• Word Frame Disambiguation
• Frame vectors and frame topic models (frame2vec for deep learning)
• OKE extensions (cf. FRED)
• Frame clustering and complex frame discovery
• Sentence frame fingerprinting (valence patterns)
• Automated matching between semantic roles
• Automated matching between roles and LOD properties
• Overlap matching between frames and LOD classes
• Assisted eXtreme Design (ODP semantic search)
• …
(✔)
(✔)
(✔)
✔
✔
26. Conclusions
• A new large resource in LOD, linking linguistic and
factual knowledge with a frame-oriented semantics,
expressible in OWL
• Evaluation wrt frame detection proves increase of
recall and state-of-the-art precision
• A lot of research themes by applying links and
shared semantics: valence patterns, clustering,
embeddings, interoperability
27. Related publications: Framester
and frame semantics
• A. Gangemi, M. Alam, L. Asprino, V. Presutti, D.R.
Recupero. 2016. Framester: A Wide Coverage
Linguistic Linked Data Hub. EKAW
• Aldo Gangemi, 2010. What’s in a Schema?,
Ontology and the Lexicon, Cambridge University
Press
• Charles J Fillmore. 1976. Frame semantics and the
nature of language. Annals of the New York
Academy of Sciences
28. Related publications:
Linguistic resources
• Maddalen Lopez de Lacalle, Egoitz Laparra, and German Rigau. 2014. Predicate Matrix: extending
SemLink through WordNet mappings. LREC
• Antoine Zimmermann, Christophe Gravier, Julien Subercaze, and Quentin Cruzille. 2013. Nell2RDF:
Read the Web, and Turn it into RDF. KNOW@LOD, CEUR
• Montse Cuadros, Llúıs Padró, German Rigau. 2012. Highlighting relevant concepts from topic
signatures. LREC
• Roberto Navigli and Simone Paolo Ponzetto. 2012. BabelNet: The Automatic Construction, Evaluation
and Application of a Wide-Coverage Multi-lingual Semantic Network. Artificial Intelligence
• Andrew Carlson, Justin Betteridge, Bryan Kisiel, Burr Settles, Estevam R Hruschka Jr, and Tom M
Mitchell. 2010. Toward an architecture for never-ending language learning. AAAI
• Martha Palmer. 2009. Semlink: Linking Prop-Bank, VerbNet and FrameNet. GenLex-09
• Karin Kipper Schuler. 2005. Verbnet: A Broad-coverage, Comprehensive Verb Lexicon. Ph.D. thesis
• Christiane Fellbaum, editor. 1998. WordNet: an electronic lexical database, MIT Press
• Collin F. Baker, Charles J. Fillmore, and John B. Lowe. 1998. The Berkeley FrameNet Project.
COLING
29. Related publications: Linked
data resources
• Linguistic linked data resources
• Antoine Zimmermann, Christophe Gravier, Julien Subercaze, and Quentin Cruzille. 2013.
Nell2RDF: Read the Web, and Turn it into RDF. KNOW@LOD
• Andrea Giovanni Nuzzolese, Aldo Gangemi, and Valentina Presutti. 2011. Gathering lexical
linked data and knowledge patterns from FrameNet. KCAP
• Mark Van Assem, Aldo Gangemi, and Guus Schreiber. 2006. Conversion of WordNet to a
standard RDF/OWL representation. LREC
• Aldo Gangemi, Roberto Navigli, and Paola Velardi. 2003. The OntoWordNet project:
Extension and axiomatization of conceptual relations in Wordnet. ODBASE
• Factual inked data resources
• Jens Lehmann, Chris Bizer, Georgi Kobilarov, Sören Auer, Christian Becker, Richard
Cyganiak, and Sebastian Hellmann. 2009. DBpedia - A Crystallization Point for the Web of
Data. Journal of Web Semantics
• Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, and Gerhard Weikum. 2013. Yago2:
A spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence
30. Related publications: Tools
• Aldo Gangemi, Valentina Presutti, Diego Reforgiato
Recupero, Andrea Giovanni Nuzzolese, Francesco
Draicchio, and Misael Mongiovi. 2016. Semantic
Web Machine Reading with FRED. Semantic Web
• Dipanjan Das, Desai Chen, André F. T. Martins,
Nathan Schneider, and Noah A. Smith. 2014.
Frame-semantic parsing. Computational Linguistics