Contextual Definition Generation

•Download as PPTX, PDF•

0 likes•104 views

Sergey Sosnovsky

This paper has been presented at the third workshop on intelligent textbooks (iTextbooks'2021).

Education

Contextual Definition
Generation
Jeffrey Yarbro, Andrew Olney
Institute for Intelligent Systems
University of Memphis

Introduction
• This paper explores the idea of generating contextual definitions
for words using a deep-learning model. It does this by accepting a
word and a context for that word and then autoregressively
generating a definition to match the specific context.
Overview
• Created a new dataset with definition and context pairs.
• Trained a GPT-2 model on dataset
• Evaluated the model with human raters

Motivation for work
• Approximately 98% of words must be within a reader’s vocabulary for
optimal reading comprehension to occur.
• Textbooks often attempt to make up for potential vocabulary gaps by
defining key-terms.
• Problems:
• Reader is required to stop reading and lookup definition
• Limited number of terms defined
• Term may have multiple definitions

Motivation for work (cont.)
• Modern software can make the process easier.
• Can use search engine
• Newer tools allow reader to highlight word and have the definition appear in
a pop-up.
• Problems:
• Definitions may be vague and not adequately fit the context.
• Word may have a long list of definitions.
• If the word has multiple definitions, you must pick the most appropriate one.

Data Collection
• All data was required to have definitions and a labeled context paired
with that definition.
• With this in mind, we collected data from the following sources:
• Lexico
• Wikipedia
• Wiktionary
• Wordnet

Data Collection (cont.)
Source:
Dataset:

Definition Modification
• Some definitions contained low information.
• We attempt to expand these definitions by using regular expressions, parts of
speech tags, and word frequency to find the key reference word.
• We then choose the most fitting definition by using word vectors and comparing
each definition for the reference word (e.g., “country”) with the context and
choose the most similar one by performing cosine similarity.

Model
• GPT-2 is an autoregressive model that uses the decoding blocks of the
transformer architecture.
1. Animation sourced from The Illustrated GPT-2 written by Jay Alammar

Model (cont.)
• Trained the model for 1 epoch
• Used GPT-2 Large: 774 parameter model.
• Two special tokens: <CONTEXT> and <DEFINITION>

Human Evaluation
• Posted survey on CloudResearch. Which sources high quality
participants on Mechanical Turks.
• Allowed participants to choose what topic they wanted to evaluate.
The topics available were from the following subjects:
• American Government
• Anatomy and Physiology
• Astronomy
• Psychology
• Three different surveys for the following context types:
1. Model-generated Short-context: Term used in a sentence
2. Model-generated Long-context: Term used in a sentence along with both
the prior and following sentence.
3. Human-generated: Definitions from the training dataset.
• Raters evaluated 50 questions each.

Results
• Short-context performed significantly better than long-context in terms of accuracy (𝑝 = 0.045). We
speculate the reason for this has to do with the training data containing far more shorter-contexts than
long.
• Real definitions performed significantly better than both short-context (𝑝 < 0.001).
• There were no significant differences between fluency.
• Topic was trending but not significant

Short-Context vs Human-Generated Density Plots

Problems with model
• Too much fluctuation depending on context.
• Trouble interpreting some contexts.
• Some tendency to memorize definitions

What's hot

Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Parang Saraf

Mining Product Reputations On the Webfeiwin

Concurrent Inference of Topic Models and Distributed Vector RepresentationsParang Saraf

Real Time Competitive Marketing Intelligencefeiwin

Ran zhou poster 2018Ran Zhou

Survey of natural language processing(midp2)Tariqul islam

Mobile ComputingShehrevar Davierwala

Interactive Analysis of Word Vector Embeddingsgleicher

Lect06DrASSayyad

Reportbutest

Data wrangling week 9Ferdin Joe John Joseph PhD

Question Answering for Machine Reading Evaluation on Romanian and EnglishFaculty of Computer Science

NAACL2015 presentationHan Xu, PhD

Text categorizationShubham Pahune

mlssMaiAGE-INRA, Paris Sud, LIMSI-CNRS

Meta-Learning PresentationAkshayaNagarajan10

TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTIONijistjournal

Query expansionSpandan Veggalam

Feature selection, optimization and clustering strategies of text documentsIJECEIAES

Machine translation course program (in English)Dmitry Kan

What's hot (20)

Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...

Mining Product Reputations On the Web

Concurrent Inference of Topic Models and Distributed Vector Representations

Real Time Competitive Marketing Intelligence

Ran zhou poster 2018

Survey of natural language processing(midp2)

Mobile Computing

Interactive Analysis of Word Vector Embeddings

Lect06

Report

Data wrangling week 9

Question Answering for Machine Reading Evaluation on Romanian and English

NAACL2015 presentation

Text categorization

mlss

Meta-Learning Presentation

TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION

Query expansion

Feature selection, optimization and clustering strategies of text documents

Machine translation course program (in English)

Similar to Contextual Definition Generation

Dice.com Bay Area Search - Beyond Learning to Rank TalkSimon Hughes

Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri

Using a keyword extraction pipeline to understand concepts in future work sec...Kai Li

ML slide share.pptxGoodReads1

Final presentationNitish Upreti

Personalized Search and Job Recommendations - Simon Hughes, Dice.comLucidworks

6.domain extraction from research papersEditorJST

Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Lucidworks

Keyword_extraction.pptxBiswarupDas18

Tutorial on Coreference Resolution Anirudh Jayakumar

Error Analysis of Rule-based Machine Translation OutputsParisa Niksefat

Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh

Natural Language Processing Advancements By Deep Learning - A SurveyAkshayaNagarajan10

DAA Mini Project.pptxAkashDudhane4

A Novel Method for Keyword Retrieval using Weighted Standard Deviation: “D4 A...idescitation

An Automatic Question Paper Generation : Using Bloom's TaxonomyIRJET Journal

Dr.saleem gul assignment summaryJaved Riza

A Gentle Introduction to Text Analysis IUNCResearchHub

Text MiningBiniam Asnake

Similar to Contextual Definition Generation (20)

Dice.com Bay Area Search - Beyond Learning to Rank Talk

Natural Language Processing, Techniques, Current Trends and Applications in I...

Using a keyword extraction pipeline to understand concepts in future work sec...

ML slide share.pptx

Final presentation

Personalized Search and Job Recommendations - Simon Hughes, Dice.com

6.domain extraction from research papers

Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...

Keyword_extraction.pptx

Tutorial on Coreference Resolution

Error Analysis of Rule-based Machine Translation Outputs

Natural Language Generation / Stanford cs224n 2019w lecture 15 Review

Natural Language Processing Advancements By Deep Learning - A Survey

DAA Mini Project.pptx

A Novel Method for Keyword Retrieval using Weighted Standard Deviation: “D4 A...

An Automatic Question Paper Generation : Using Bloom's Taxonomy

Dr.saleem gul assignment summary

A Gentle Introduction to Text Analysis I

Text Mining

Recently uploaded

Interactive Powerpoint_How to Master effective communicationnomboosow

microwave assisted reaction. General introductionMaksud Ahmed

Código Creativo y Arte de Software | Unidad 1Maestría en Comunicación Digital Interactiva - UNR

Crayon Activity Handout For the Crayon AUnboundStockton

Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Accessible design: Minimum effort, maximum impactdawncurless

Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita

Alper Gobel In Media Res Media ComponentInMediaRes1

Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton

URLs and Routing in the Odoo 17 Website AppCeline George

_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

Software Engineering Methodologies (overview)eniolaolutunde

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

Recently uploaded (20)

Interactive Powerpoint_How to Master effective communication

microwave assisted reaction. General introduction

Código Creativo y Arte de Software | Unidad 1

Crayon Activity Handout For the Crayon A

Employee wellbeing at the workplace.pptx

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Accessible design: Minimum effort, maximum impact

Class 11 Legal Studies Ch-1 Concept of State .pdf

Alper Gobel In Media Res Media Component

Science 7 - LAND and SEA BREEZE and its Characteristics

URLs and Routing in the Odoo 17 Website App

_Math 4-Q4 Week 5.pptx Steps in Collecting Data

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

Introduction to ArtificiaI Intelligence in Higher Education

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

Sanyam Choudhary Chemistry practical.pdf

Software Engineering Methodologies (overview)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx

Contextual Definition Generation

1. Contextual Definition Generation Jeffrey Yarbro, Andrew Olney Institute for Intelligent Systems University of Memphis

2. Introduction • This paper explores the idea of generating contextual definitions for words using a deep-learning model. It does this by accepting a word and a context for that word and then autoregressively generating a definition to match the specific context. Overview • Created a new dataset with definition and context pairs. • Trained a GPT-2 model on dataset • Evaluated the model with human raters

3. Motivation for work • Approximately 98% of words must be within a reader’s vocabulary for optimal reading comprehension to occur. • Textbooks often attempt to make up for potential vocabulary gaps by defining key-terms. • Problems: • Reader is required to stop reading and lookup definition • Limited number of terms defined • Term may have multiple definitions

4. Motivation for work (cont.) • Modern software can make the process easier. • Can use search engine • Newer tools allow reader to highlight word and have the definition appear in a pop-up. • Problems: • Definitions may be vague and not adequately fit the context. • Word may have a long list of definitions. • If the word has multiple definitions, you must pick the most appropriate one.

5. Data Collection • All data was required to have definitions and a labeled context paired with that definition. • With this in mind, we collected data from the following sources: • Lexico • Wikipedia • Wiktionary • Wordnet

6. Data Collection (cont.) Source: Dataset:

7. Definition Modification • Some definitions contained low information. • We attempt to expand these definitions by using regular expressions, parts of speech tags, and word frequency to find the key reference word. • We then choose the most fitting definition by using word vectors and comparing each definition for the reference word (e.g., “country”) with the context and choose the most similar one by performing cosine similarity.

8. Model • GPT-2 is an autoregressive model that uses the decoding blocks of the transformer architecture. 1. Animation sourced from The Illustrated GPT-2 written by Jay Alammar

9. Model (cont.) • Trained the model for 1 epoch • Used GPT-2 Large: 774 parameter model. • Two special tokens: <CONTEXT> and <DEFINITION>

10. Human Evaluation • Posted survey on CloudResearch. Which sources high quality participants on Mechanical Turks. • Allowed participants to choose what topic they wanted to evaluate. The topics available were from the following subjects: • American Government • Anatomy and Physiology • Astronomy • Psychology • Three different surveys for the following context types: 1. Model-generated Short-context: Term used in a sentence 2. Model-generated Long-context: Term used in a sentence along with both the prior and following sentence. 3. Human-generated: Definitions from the training dataset. • Raters evaluated 50 questions each.

11. Survey Format

12. Results • Short-context performed significantly better than long-context in terms of accuracy (𝑝 = 0.045). We speculate the reason for this has to do with the training data containing far more shorter-contexts than long. • Real definitions performed significantly better than both short-context (𝑝 < 0.001). • There were no significant differences between fluency. • Topic was trending but not significant

13. Short-Context vs Human-Generated Density Plots

14. Problems with model • Too much fluctuation depending on context. • Trouble interpreting some contexts. • Some tendency to memorize definitions

15. Q&A

Contextual Definition Generation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Contextual Definition Generation

Similar to Contextual Definition Generation (20)

More from Sergey Sosnovsky

More from Sergey Sosnovsky (20)

Recently uploaded

Recently uploaded (20)

Contextual Definition Generation