Detecting Semantic Relations between Nominals using Support Vector Machines and Linguistic-Based Rules

Detecting Semantic Relations between
Nominals using Support Vector Machines and
Linguistic-Based Rules 1
Detecting Semantic Relations
between Nominals using
Support Vector Machines and
Linguistic-Based Rules
Isabel Segura Bedmar, Doaa Samy, José L.
Martínez Fernández, Paloma Martínez
Universidad Carlos III de Madrid

Detecting Semantic
Relations between
Nominals using Support
Vector Machines and
Agenda
 Motivation
 Task 4 of SEMEVAL 2007: Classification of
Semantic Relations Between Nominales
 The Automatic System
 The Set of Linguistic Rules
 Results
 Conclusions
 Future Work

Detecting Semantic
Relations between
Vector Machines and
Motivation
 Ontology Development to support Semantic Web has
lead to the improvement of techniques capable of
managing underlying semantic relations between
concepts.
 Detecting semantic relations is a key issue in automating the
ontology learning process.
 In Question Answering System, detecting semantic
relations could help to find the exact answers for
questions such as:
 What are the chemical components of aspirin? (Part-Whole).
 List of all causes from asthma (Cause-Effect)
 List of all country producer of olive oil” (Product-Producer)

Detecting Semantic
Relations between
Vector Machines and
SEMEVAL 2007 (ACL 2007). Task:
Classification of Semantic Relations
between Nominals
 Seven semantic relations:
 Cause-Effect (e.g., virus-flu)
 Instrument-User (e.g., laser-print)
 Product-Producer (e.g., honey-bee)
 Origin-Entity (e.g., rye-whiskey)
 Purpose-Tool (e.g., soup-pot)
 Part-Whole (e.g., wheel-car)
 Content-Container (e.g., apple-basket)
 For the relation Part-Whole, our automatic
system was the first in this SEMEVAL task.

Detecting Semantic
Relations between
Vector Machines and
Training and Testing Data
Put tea in a heat-resistant jug
Put <e1>tea</e1> in a <e2>heat-resistant
jug</e2>
Query=“* in a *”
WordNet(e1) = "tea%1:13:00::“
WordNet(e2) = "jug%1:06:00::“
Training sentences: Content-Container(e1,
e2)=true
Testing sentences: Content-Container(e1, e2)=?

Detecting Semantic
Relations between
Vector Machines and
The Automatic System

Detecting Semantic
Relations between
Vector Machines and
Features
 The set of features used for the classification of semantic relations includes
information from different levels:
 Lexical Features

Nominals (blue room) and their heads (room)

The two word tokens occurring before, after the nominals and the list of words
between both nominals

Information regarding Preposition and Verb
 Query
 POS Features

POS tags of tokens in the lexical features

Path: concatenated POS tags of wordlist between nominals
 Semantic Features

Lexical File numbers: act, animal, person, location, etc

Synset number

WordNet Vector: select the synsets of the third level of depth. If the synset is
ancestor of the nominal, it is assigned the value 1, else it is assigned the value 0.

Detecting Semantic
Relations between
Vector Machines and
Rules
 The rules do not pretend to achieve a total
coverage of the whole dataset, but to improve
the results of the learning algorithm in cases
where it is obvius to decide if the relation is
true o false according to simple semantic and
knwoledge constraints.
 The rules are based on:
 WordNet Information
 POS
 Lexical

Detecting Semantic
Relations between
Vector Machines and
Classification of Rules:
 Semantic rules
 Lexico-semantic rules
 Morphosyntactic rules
 Complex rules combining two or
more of the above mentioned criteria.

Detecting Semantic
Relations between
Vector Machines and
Example of Rule for the
Relation Cause-Effect
 IF (Cause = TIME OR
Cause = LOCATION OR
Cause = PERSON OR
Cause = ANIMAL OR
Cause = GROUP OR
Cause = COMMUNICATIVE_ACT)
THEN Relation(Cause-
Effect)=False

Detecting Semantic
Relations between
Vector Machines and
Example of Rule for Product-
Producer Relation
 Pattern: <PRODUCER> verb <PRODUCT>
IF (verb (active voice) =
{make| produce| manufacture|fabricate}
{cultivate| grow| farm for}
{develop| compose|prepare}
{create| procreate}
{come from}
{provide}
{emit}
AND
PRODUCT!=STATE AND PRODUCT!=ACTIVITY AND
PRODUCT!=TIME
Then Relation(Product-Producer)=TRUE

Detecting Semantic
Relations between
Vector Machines and
Coverage and Accuracy
Relations
Rules
(num)
Coverage
Train
Coverag
e Test
Acc.
SVM
Acc.
SVM+
Rules
Improvement
Cause-Effect 4 25% 27% 56,2% 60,75% 4,5%
Inst-Agency 3 8% 8% 69,2% 70,5% 1,3%
Prod-Producer 7 50% 48% 60,2% 75% 15%
Origin-Entity 15 36% 38% 65,4% 70,5% 5%
Theme-Tool 6 38% 25% 53,5% 60% 6,7%
Part-Whole 45 62% 36% 70,8% 76,38% 5.58%
C-Container 15 16% 18% 60.8% 55.40% -5.4%

Detecting Semantic
Relations between
Vector Machines and

Detecting Semantic
Relations between
Vector Machines and
Conclusion
 Results obtained demonstrate that
applying a hybrid approach on a
supervised system achieves good
results in a semantic classification task
between nominals.

Detecting Semantic
Relations between
Vector Machines and
Future Work
 Wider use of semantic information such as hypernym
for each synset or information regarding the entity
features.
 Introducing more syntactic information through
syntactic chunking could affect significantly the
results.
 In the rule-based module, we consider increasing the
set of rules for a broader coverage.
 At present, we use this approach in the biomedicine
domain for detecting relations between proteins,
genes, drugs, etc.

Detecting Semantic
Relations between
Vector Machines and
Thank you for you attention

Detecting Semantic Relations between Nominals using Support Vector Machines and Linguistic-Based Rules

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (13)

Similar to Detecting Semantic Relations between Nominals using Support Vector Machines and Linguistic-Based Rules

Similar to Detecting Semantic Relations between Nominals using Support Vector Machines and Linguistic-Based Rules (20)

More from Grupo HULAT

More from Grupo HULAT (20)

Recently uploaded

Recently uploaded (20)

Detecting Semantic Relations between Nominals using Support Vector Machines and Linguistic-Based Rules

Editor's Notes