SlideShare a Scribd company logo
1 of 23
Download to read offline
1
Kostadin Cholakov, Judith Eckle-Kohler and Iryna Gurevych
Automated Verb Sense Labelling
Based on Linked Lexical Resources
2
Outline
Evaluation
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Take Home Messages
Automated Verb Sense Labelling in a Nutshell
3April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Motivation
Motivation
 Sense annotated corpora are important resources in NLP
 usually created manually which is time consuming and expensive
 verbs have more senses and thus, annotating verb senses is more
difficult
Solution
 Using a large-scale linked lexical resource for creating data annotated
with verb senses automatically
UBY
4
Linking Lexical Resources at the Word Sense
Level – example: UBY
Web 2.0
IMSLex-Subcat
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
UBY
5
Linking Lexical Resources at the Word Sense
Level – example: UBY
Web 2.0
IMSLex-Subcat
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
UBY
Open Source Java API: http://code.google.com/p/uby/
6April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Automated Verb Sense Labelling: Approach
UBY
Corpus
Uby: Verb Sense Patterns derived from lexical information
Corpus: Verb Sense Patterns derived from verb instances
Similarity Metric
7
WN ask%2:32:01 (make a request or demand for something to somebody)
is linked to FN Id 639 (request to do or give something):
As twenty are required it might pay to ask your supplier for a ` bulk discount ".
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Step 1: Creation of sense patterns from
enriched senses
UBY
Uby: [ask%2:32:0] be PP VV to ask person for a JJ act
8April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Step 1: Creation of sense patterns from
enriched senses
9April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Step 1: Creation of sense patterns from
enriched senses
sense enrichment predicate argument structure information
10April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Step 2: Automated Labelling based on Pattern
Similarity
WN ask%2:32:01 is linked to FN Id 639:
As twenty are required it might pay to ask your supplier for a ` bulk discount ".
UBY
he would n't be pleased if a rumdum like me were to ask
his daughter for a date
Similarity score: 0.217 > threshold
Uby: [ask%2:32:01] be PP VV to ask person for a JJ act
Corpus: if PP be to ask person for a time
11April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Step 2: Automated Labelling based on Pattern
Similarity
 Using a similarity metric to compare patterns derived from UBY and
patterns derived from verb instances found in corpora
 Considers the common bi-, tri-, and four-grams of two patterns:
 Takes word order into account!
w >= 1 is the window around the verb
Gn(pi) is the set of ngrams occurring in pattern pi
12
Outline
Evaluation
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Take Home Messages
Automated Verb Sense Labelling in a Nutshell
13April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Intrinsic Evaluation
 Evaluation for occurrences of Senseval-3 verbs in SemCor (152 verbs)
 Ca. 33.000 sense patterns generated from WN-FN-WKT for these verbs
 various similarity thresholds t
14April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Extrinsic Evaluation – Experimental Setup
Comparison of two supervised classifiers for verb sense
disambiguation:
1. Trained on an automatically labelled corpus (ALC):
 Verb senses for test verbs given in MASC and Senseval-3 are
labelled in a huge Web Corpus with similarity threshold t=0.1
2. Trained on SemCor 3.0
Test data:
1. MASC corpus: 16 verbs annotated with WordNet 3.0 senses, 11 997
test instances
2. Senseval-3 dataset for all-words WSD: 152 verbs annotated with
WordNet 3.0 senses, 442 test instances
15April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Training Sets
0 100000 200000 300000 400000
Training Data ALC
SemCor
 SemCor 3.0
 Ca. 22.000 train instances of 16
MASC and 152 Senseval-3 verbs
 Automatically labelled corpus (ALC)
 Ca. 350.000 train instances of 16
MASC and 152 Senseval-3 verbs
16April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Classification
Preprocessing: POS tagging, dependency parsing and Named
Entity recognition
 using the TreeTagger and the Stanford Parser and Named
Entity Recognizer form the DKPro Core component collection,
http://dkpro-core-asl.googlecode.com
Features: lexical, syntactic and semantic features
Classification: A separate logistic regression classifier is
trained for each of the test verbs, using WEKA,
http://www.cs.waikato.ac.nz/ml/weka/
17
Performance of classifiers (accuracy)
evaluated on MASC / Senseval-3
SemCor 3.0
 Evaluation on MASC: 50.23
 Evaluation on Senseval-3: 48.64
(45.20 with back-off)
Automatically labelled corpus (ALC)
 Evaluation on MASC: 49.00
 Evaluation on Senseval-3: 47.51
(43.24 with back-off)
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
MFS Baseline for the two test sets
1. MASC: MFS baseline: 41.72
2. Senseval-3: MFS baseline: 25.34
Training Sets
18April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Extrinsic Evaluation – effect of sense
enrichment
 Best results with the combination WordNet-FrameNet-Wiktionary
 WordNet-FrameNet achieves similar accuracy but the coverage is lower
 WordNet-FrameNet-Wiktionary-VerbNet achieves lower accuracy
 Using WordNet only achieved the lowest coverage and accuracy
19
Outline
Evaluation
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Take Home Messages
Automated Verb Sense Labelling in a Nutshell
20
Linked Lexical Resources such as UBY are knowledge bases …
 … that can be used to perform automated verb sense labelling
 the automatically labelled data can successfully be used to train
supervised Machine Learning systems: Distant / Weak Supervision
 This is due to the enriched sense representation for word senses
that are interlinked
 Particularly useful for languages such as German where lexical resources
are available but no sense-labelled data exist.
April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Take Home Messages
21April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Thank You!
Questions?
22April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Training Data Coverage
Coverage of WN senses annotated in MASC in the training data:
 There are 22 WN senses with instances in MASC which are not found in
SemCor
 There are 34 WN senses with instances in MASC which are not found in
the ALC
 The VSD system cannot correctly classify instances of those senses
 The Coverage of the WN senses annotated in the test sets by the training
data constitutes the upper bound of our classifiers:
 ALC: 0.8805 (increasing the size of the ALC does not help)
 SemCor: 0.948
23April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler
Comparison with other systems for verb sense
disambiguation
 State-of-the-art supervised system (Chen and Palmer 2009) on Senseval-
2 data :
 0.648 accuracy, MFS baseline: 0.407
 Not comparable due to different versions of WordNet used
 Best performing Lesk-based system (Miller et al., 2012):
 33.86% accuracy for the MASC verbs
 30.16% accuracy for the Senseval-3 verbs

More Related Content

Viewers also liked

Программные решения HP Network Management Center NMC
Программные решения HP Network Management Center NMCПрограммные решения HP Network Management Center NMC
Программные решения HP Network Management Center NMCAnatoliy Arkhipov
 
Daily option news letter 02 july 2013
Daily option news letter 02 july 2013Daily option news letter 02 july 2013
Daily option news letter 02 july 2013Rakhi Tips Provider
 
งานคอม ใบงานที่ 2 8 แก้แล้ว
งานคอม ใบงานที่ 2 8 แก้แล้วงานคอม ใบงานที่ 2 8 แก้แล้ว
งานคอม ใบงานที่ 2 8 แก้แล้วnoeiinoii
 
Designing for self-serve science
Designing for self-serve scienceDesigning for self-serve science
Designing for self-serve sciencedhalperi
 
Blanca nieves y los siete enanitos tatiana jabo juàrez
Blanca nieves y los siete enanitos  tatiana jabo juàrezBlanca nieves y los siete enanitos  tatiana jabo juàrez
Blanca nieves y los siete enanitos tatiana jabo juàreztatianadamari
 
Texecom συναγερμοί από τη GNEMS
Texecom συναγερμοί από τη GNEMSTexecom συναγερμοί από τη GNEMS
Texecom συναγερμοί από τη GNEMSGeorgios Ath. Kounis
 
ใบงานท9 16มม
ใบงานท9 16มมใบงานท9 16มม
ใบงานท9 16มมnoeiinoii
 
Frenemies to Friends - Bridging the Gap Between Marketing & Sales
Frenemies to Friends - Bridging the Gap Between Marketing & SalesFrenemies to Friends - Bridging the Gap Between Marketing & Sales
Frenemies to Friends - Bridging the Gap Between Marketing & SalesCourtney Wilson
 

Viewers also liked (14)

Программные решения HP Network Management Center NMC
Программные решения HP Network Management Center NMCПрограммные решения HP Network Management Center NMC
Программные решения HP Network Management Center NMC
 
Daily option news letter 02 july 2013
Daily option news letter 02 july 2013Daily option news letter 02 july 2013
Daily option news letter 02 july 2013
 
Bio ent47
Bio ent47Bio ent47
Bio ent47
 
05 e
05 e05 e
05 e
 
Biostatflow
BiostatflowBiostatflow
Biostatflow
 
งานคอม ใบงานที่ 2 8 แก้แล้ว
งานคอม ใบงานที่ 2 8 แก้แล้วงานคอม ใบงานที่ 2 8 แก้แล้ว
งานคอม ใบงานที่ 2 8 แก้แล้ว
 
Designing for self-serve science
Designing for self-serve scienceDesigning for self-serve science
Designing for self-serve science
 
Blanca nieves y los siete enanitos tatiana jabo juàrez
Blanca nieves y los siete enanitos  tatiana jabo juàrezBlanca nieves y los siete enanitos  tatiana jabo juàrez
Blanca nieves y los siete enanitos tatiana jabo juàrez
 
Texecom Style Guide 2
Texecom Style Guide 2Texecom Style Guide 2
Texecom Style Guide 2
 
Texecom συναγερμοί από τη GNEMS
Texecom συναγερμοί από τη GNEMSTexecom συναγερμοί από τη GNEMS
Texecom συναγερμοί από τη GNEMS
 
style-guide
style-guidestyle-guide
style-guide
 
ใบงานท9 16มม
ใบงานท9 16มมใบงานท9 16มม
ใบงานท9 16มม
 
Rocks assignment 5
Rocks assignment 5Rocks assignment 5
Rocks assignment 5
 
Frenemies to Friends - Bridging the Gap Between Marketing & Sales
Frenemies to Friends - Bridging the Gap Between Marketing & SalesFrenemies to Friends - Bridging the Gap Between Marketing & Sales
Frenemies to Friends - Bridging the Gap Between Marketing & Sales
 

Similar to Automated Verb Sense Labelling Based on Linked Lexical Resources. Presentation by Judith Eckle-Kohler at EACL 2014 in Gothenburg. Joint work with Kostadin Cholakov and Iryna Gurevych.

Natural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual DataNatural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual Datagpano
 
Presentation of eQNet to LRE subcommittee october 11, 2011
Presentation of eQNet to LRE subcommittee october 11, 2011Presentation of eQNet to LRE subcommittee october 11, 2011
Presentation of eQNet to LRE subcommittee october 11, 2011Elena Shulman
 
project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysissneha penmetsa
 
Update to the ELIXIR Board Nov 2014
Update to the ELIXIR Board Nov 2014Update to the ELIXIR Board Nov 2014
Update to the ELIXIR Board Nov 2014ELIXIR UK
 
Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Maria Eskevich
 
A Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering SystemsA Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering Systemsijaia
 
Mahesh Joshi
Mahesh JoshiMahesh Joshi
Mahesh Joshibutest
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Educationarammann
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Educationiloughlab
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Educationstemrpm
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Educationstemprm
 
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...HCI Lab
 
Deployment of remote experiments
Deployment of remote experiments  Deployment of remote experiments
Deployment of remote experiments guestcb3e966
 
Using the nominal group technique with clickers to research student experienc...
Using the nominal group technique with clickers to research student experienc...Using the nominal group technique with clickers to research student experienc...
Using the nominal group technique with clickers to research student experienc...Tünde Varga-Atkins
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaSpark Summit
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextHortonworks
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaSpark Summit
 
Making the introductory science lab accessible online apr 2012
Making the introductory science lab accessible online apr 2012Making the introductory science lab accessible online apr 2012
Making the introductory science lab accessible online apr 2012gregkp
 

Similar to Automated Verb Sense Labelling Based on Linked Lexical Resources. Presentation by Judith Eckle-Kohler at EACL 2014 in Gothenburg. Joint work with Kostadin Cholakov and Iryna Gurevych. (20)

Natural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual DataNatural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual Data
 
Presentation of eQNet to LRE subcommittee october 11, 2011
Presentation of eQNet to LRE subcommittee october 11, 2011Presentation of eQNet to LRE subcommittee october 11, 2011
Presentation of eQNet to LRE subcommittee october 11, 2011
 
project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysis
 
Brownie v1.0
Brownie v1.0Brownie v1.0
Brownie v1.0
 
Resume_bharadwaj(3515)
Resume_bharadwaj(3515)Resume_bharadwaj(3515)
Resume_bharadwaj(3515)
 
Update to the ELIXIR Board Nov 2014
Update to the ELIXIR Board Nov 2014Update to the ELIXIR Board Nov 2014
Update to the ELIXIR Board Nov 2014
 
Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014
 
A Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering SystemsA Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering Systems
 
Mahesh Joshi
Mahesh JoshiMahesh Joshi
Mahesh Joshi
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Education
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Education
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Education
 
Constructivist Laboratory Education
Constructivist Laboratory EducationConstructivist Laboratory Education
Constructivist Laboratory Education
 
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...
Augmenting Speech-Language Rehabilitation with Brain Computer Interfaces: An ...
 
Deployment of remote experiments
Deployment of remote experiments  Deployment of remote experiments
Deployment of remote experiments
 
Using the nominal group technique with clickers to research student experienc...
Using the nominal group technique with clickers to research student experienc...Using the nominal group technique with clickers to research student experienc...
Using the nominal group technique with clickers to research student experienc...
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey Stella
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey Stella
 
Making the introductory science lab accessible online apr 2012
Making the introductory science lab accessible online apr 2012Making the introductory science lab accessible online apr 2012
Making the introductory science lab accessible online apr 2012
 

Recently uploaded

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiologyDrAnita Sharma
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2John Carlo Rollon
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxVarshiniMK
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsCharlene Llagas
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxdharshini369nike
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Masticationvidulajaib
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 

Recently uploaded (20)

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiology
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptx
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of Traits
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptx
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Mastication
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 

Automated Verb Sense Labelling Based on Linked Lexical Resources. Presentation by Judith Eckle-Kohler at EACL 2014 in Gothenburg. Joint work with Kostadin Cholakov and Iryna Gurevych.

  • 1. 1 Kostadin Cholakov, Judith Eckle-Kohler and Iryna Gurevych Automated Verb Sense Labelling Based on Linked Lexical Resources
  • 2. 2 Outline Evaluation April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Take Home Messages Automated Verb Sense Labelling in a Nutshell
  • 3. 3April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Motivation Motivation  Sense annotated corpora are important resources in NLP  usually created manually which is time consuming and expensive  verbs have more senses and thus, annotating verb senses is more difficult Solution  Using a large-scale linked lexical resource for creating data annotated with verb senses automatically UBY
  • 4. 4 Linking Lexical Resources at the Word Sense Level – example: UBY Web 2.0 IMSLex-Subcat April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler UBY
  • 5. 5 Linking Lexical Resources at the Word Sense Level – example: UBY Web 2.0 IMSLex-Subcat April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler UBY Open Source Java API: http://code.google.com/p/uby/
  • 6. 6April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Automated Verb Sense Labelling: Approach UBY Corpus Uby: Verb Sense Patterns derived from lexical information Corpus: Verb Sense Patterns derived from verb instances Similarity Metric
  • 7. 7 WN ask%2:32:01 (make a request or demand for something to somebody) is linked to FN Id 639 (request to do or give something): As twenty are required it might pay to ask your supplier for a ` bulk discount ". April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Step 1: Creation of sense patterns from enriched senses UBY Uby: [ask%2:32:0] be PP VV to ask person for a JJ act
  • 8. 8April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Step 1: Creation of sense patterns from enriched senses
  • 9. 9April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Step 1: Creation of sense patterns from enriched senses sense enrichment predicate argument structure information
  • 10. 10April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Step 2: Automated Labelling based on Pattern Similarity WN ask%2:32:01 is linked to FN Id 639: As twenty are required it might pay to ask your supplier for a ` bulk discount ". UBY he would n't be pleased if a rumdum like me were to ask his daughter for a date Similarity score: 0.217 > threshold Uby: [ask%2:32:01] be PP VV to ask person for a JJ act Corpus: if PP be to ask person for a time
  • 11. 11April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Step 2: Automated Labelling based on Pattern Similarity  Using a similarity metric to compare patterns derived from UBY and patterns derived from verb instances found in corpora  Considers the common bi-, tri-, and four-grams of two patterns:  Takes word order into account! w >= 1 is the window around the verb Gn(pi) is the set of ngrams occurring in pattern pi
  • 12. 12 Outline Evaluation April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Take Home Messages Automated Verb Sense Labelling in a Nutshell
  • 13. 13April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Intrinsic Evaluation  Evaluation for occurrences of Senseval-3 verbs in SemCor (152 verbs)  Ca. 33.000 sense patterns generated from WN-FN-WKT for these verbs  various similarity thresholds t
  • 14. 14April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Extrinsic Evaluation – Experimental Setup Comparison of two supervised classifiers for verb sense disambiguation: 1. Trained on an automatically labelled corpus (ALC):  Verb senses for test verbs given in MASC and Senseval-3 are labelled in a huge Web Corpus with similarity threshold t=0.1 2. Trained on SemCor 3.0 Test data: 1. MASC corpus: 16 verbs annotated with WordNet 3.0 senses, 11 997 test instances 2. Senseval-3 dataset for all-words WSD: 152 verbs annotated with WordNet 3.0 senses, 442 test instances
  • 15. 15April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Training Sets 0 100000 200000 300000 400000 Training Data ALC SemCor  SemCor 3.0  Ca. 22.000 train instances of 16 MASC and 152 Senseval-3 verbs  Automatically labelled corpus (ALC)  Ca. 350.000 train instances of 16 MASC and 152 Senseval-3 verbs
  • 16. 16April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Classification Preprocessing: POS tagging, dependency parsing and Named Entity recognition  using the TreeTagger and the Stanford Parser and Named Entity Recognizer form the DKPro Core component collection, http://dkpro-core-asl.googlecode.com Features: lexical, syntactic and semantic features Classification: A separate logistic regression classifier is trained for each of the test verbs, using WEKA, http://www.cs.waikato.ac.nz/ml/weka/
  • 17. 17 Performance of classifiers (accuracy) evaluated on MASC / Senseval-3 SemCor 3.0  Evaluation on MASC: 50.23  Evaluation on Senseval-3: 48.64 (45.20 with back-off) Automatically labelled corpus (ALC)  Evaluation on MASC: 49.00  Evaluation on Senseval-3: 47.51 (43.24 with back-off) April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler MFS Baseline for the two test sets 1. MASC: MFS baseline: 41.72 2. Senseval-3: MFS baseline: 25.34 Training Sets
  • 18. 18April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Extrinsic Evaluation – effect of sense enrichment  Best results with the combination WordNet-FrameNet-Wiktionary  WordNet-FrameNet achieves similar accuracy but the coverage is lower  WordNet-FrameNet-Wiktionary-VerbNet achieves lower accuracy  Using WordNet only achieved the lowest coverage and accuracy
  • 19. 19 Outline Evaluation April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Take Home Messages Automated Verb Sense Labelling in a Nutshell
  • 20. 20 Linked Lexical Resources such as UBY are knowledge bases …  … that can be used to perform automated verb sense labelling  the automatically labelled data can successfully be used to train supervised Machine Learning systems: Distant / Weak Supervision  This is due to the enriched sense representation for word senses that are interlinked  Particularly useful for languages such as German where lexical resources are available but no sense-labelled data exist. April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Take Home Messages
  • 21. 21April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Thank You! Questions?
  • 22. 22April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Training Data Coverage Coverage of WN senses annotated in MASC in the training data:  There are 22 WN senses with instances in MASC which are not found in SemCor  There are 34 WN senses with instances in MASC which are not found in the ALC  The VSD system cannot correctly classify instances of those senses  The Coverage of the WN senses annotated in the test sets by the training data constitutes the upper bound of our classifiers:  ALC: 0.8805 (increasing the size of the ALC does not help)  SemCor: 0.948
  • 23. 23April 28, 2014 | Computer Science Department | UKP Lab Prof. Iryna Gurevych | Dr. Judith Eckle-Kohler Comparison with other systems for verb sense disambiguation  State-of-the-art supervised system (Chen and Palmer 2009) on Senseval- 2 data :  0.648 accuracy, MFS baseline: 0.407  Not comparable due to different versions of WordNet used  Best performing Lesk-based system (Miller et al., 2012):  33.86% accuracy for the MASC verbs  30.16% accuracy for the Senseval-3 verbs