An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

•

0 likes•39 views

Takashi Inui

splu2020 presentation slide

Data & Analytics

An Element-wise Visual-Enhanced
BiLSTM-CRF Model for Location
Name Recognition
Takuya Komada, Takashi Inui
Department of Computer Science
University of Tsukuba
1
komada@mibel.cs.tsukubai.ac.jp

Background
l Location Name
l Location information is one of the essential
components for some NLP applications.
l E.g. location name disambiguation , mapping of
location names to geographic locations.
l Named Entity Recognition (NER)
l Detect entities and classify each entity in pre-
defined types. e.g., LOC, PER, ORG.
l For example, Leading to [Tokyo LOC].
komada@mibel.cs.tsukubai.ac.jp
2

Background
l Multimodal NER
l Deep learning models using visual information.
l Extract named entities from image attached
social media posts. (Twitter, SnapChat, etc)
l Related works
l Moon, Lu, Zhang Proposed a neural NER model
using images attached to document.
l Only use image attached to the document.
komada@mibel.cs.tsukubai.ac.jp
3

Background
l Effectiveness of Images
l Visual information can explain word meanings.
l Skyscrapers in Fig. 1
l Townscapes surrounded by mountains and rivers in
Fig. 2
l Shenzhen and Dubai have the same NE aspect and
have similarities in their images
komada@mibel.cs.tsukubai.ac.jp
4

Research Objective
l Image data corresponding to each word
would provide rich information of word
meanings.
l Propose a method that utilizes images more
effectively.
l image data are obtained for each word.
l Introduce a Gate mechanism
l Control the extent to which the visual feature.
komada@mibel.cs.tsukubai.ac.jp
5

Proposed Model
l (Baseline) Character-based BiLSTM-CRF Model
komada@mibel.cs.tsukubai.ac.jp
6

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
7
l Obtain Image Embeddings

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
8
l Image Retrieval

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
9
l Image Retrieval

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
10
l Obtaining Visual Embeddings

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
11
l Visual (Simple) Model

Proposed Model
komada@mibel.cs.tsukubai.ac.jp
12
l Visual (Gate) Model

Experiments
l Dataset
l Extended Named Entity corpus
l Images
l Google Images with photo option
l Queries are all nouns in documents.
l Obtain top 15 images
komada@mibel.cs.tsukubai.ac.jp
13

Experiments
l Settings
l Baseline: No use of visual features.
l Visual (Simple): Uses the element-wise visual
features.
l Visual (Gate): Uses element-wise visual features
with the Gate mechanism.
komada@mibel.cs.tsukubai.ac.jp
14

Experimental Results
l Results
l Both models using element-wise visual features
outperformed the baseline model.
komada@mibel.cs.tsukubai.ac.jp
15

Experimental Results
l Examples
l (ex.1-E) Signed by 117 countries and two regions at the
Final Protocol and Convention Signing Conference in
[Jamaica Country] in December 1982.
l (ex.2-E) Finally, tomorrow is the last day of our stay in
France, except for the day we leave. We re going to
[Avignon City].
komada@mibel.cs.tsukubai.ac.jp
16

Experimental Results
l Unseen words
l precision values improved most significantly
(Seen: +5.08, Unseen: +7.07).
l improve the performance of true-negatives.
komada@mibel.cs.tsukubai.ac.jp
17

Conclusions
l Propose an element-wise visual-enhanced NER
model
l Element-wise visual feature
l Image retrieval
l Gate mechanism
l Achieved a higher F1-value performance than the
baseline model
l Future research
l Investigate of effectiveness in other NE classes.
l Improve our model by conducting elaborate query
investigations that are motivated by the error analysis.
l Attempt queries with nouns and adjectives/verbs.
komada@mibel.cs.tsukubai.ac.jp
18

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

Eren_Golge_MS_Thesis_2014Bilkent University

Aj2418721874IJMER

Image Search: Then and NowSi Krishan

ei2106-submit-opt-415Joseph Lanzone

Chapter 1- Introduction.pptTigistTilahun1

07slide.pptNuurAxmed2

Extraction of Buildings from Satellite ImagesAkanksha Prasad

Opps approch of software developmentRaja Babu

IMAGE CONTENT DESCRIPTION USING LSTM APPROACHcsandit

Apple Machine LearningDenise Nepraunig

Learning with Relative AttributesVikas Jain

Automated Image Captioning – Model Based on CNN – GRU ArchitectureIRJET Journal

最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui

OCL3_10_05.pptxNitinShelake4

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2Dr. Aparna Varde

Image Object Detection PipelineAbhinav Dadhich

sachin presentation.pptxMareeswaranM7

Ch05lect1 udAhmet Balkan

Presentation1.pptxK Manjunath

System designing approachesJaipal Dhobale

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition (20)

Eren_Golge_MS_Thesis_2014

Aj2418721874

Image Search: Then and Now

ei2106-submit-opt-415

Chapter 1- Introduction.ppt

07slide.ppt

Extraction of Buildings from Satellite Images

Opps approch of software development

IMAGE CONTENT DESCRIPTION USING LSTM APPROACH

Apple Machine Learning

Learning with Relative Attributes

Automated Image Captioning – Model Based on CNN – GRU Architecture

最近の研究情勢についていくために - Deep Learningを中心に -

OCL3_10_05.pptx

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2

Image Object Detection Pipeline

sachin presentation.pptx

Ch05lect1 ud

Presentation1.pptx

System designing approaches

Recently uploaded

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Midocean dropshipping via API with DroFxolyaivanovalion

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

B2 Creative Industry Response Evaluation.docxStephen266013

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Halmar dropshipping via API with DroFxolyaivanovalion

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Recently uploaded (20)

VidaXL dropshipping via API with DroFx.pptx

Ravak dropshipping via API with DroFx.pptx

Midocean dropshipping via API with DroFx

RA-11058_IRR-COMPRESS Do 198 series of 1998

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

FESE Capital Markets Fact Sheet 2024 Q1.pdf

B2 Creative Industry Response Evaluation.docx

BabyOno dropshipping via API with DroFx.pptx

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

CebaBaby dropshipping via API with DroFX.pptx

Halmar dropshipping via API with DroFx

Log Analysis using OSSEC sasoasasasas.pptx

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Unveiling Insights: The Role of a Data Analyst

Call Girls In Mahipalpur O9654467111 Escorts Service

An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

1. An Element-wise Visual-Enhanced BiLSTM-CRF Model for Location Name Recognition Takuya Komada, Takashi Inui Department of Computer Science University of Tsukuba 1 komada@mibel.cs.tsukubai.ac.jp

2. Background l Location Name l Location information is one of the essential components for some NLP applications. l E.g. location name disambiguation , mapping of location names to geographic locations. l Named Entity Recognition (NER) l Detect entities and classify each entity in pre- defined types. e.g., LOC, PER, ORG. l For example, Leading to [Tokyo LOC]. komada@mibel.cs.tsukubai.ac.jp 2

3. Background l Multimodal NER l Deep learning models using visual information. l Extract named entities from image attached social media posts. (Twitter, SnapChat, etc) l Related works l Moon, Lu, Zhang Proposed a neural NER model using images attached to document. l Only use image attached to the document. komada@mibel.cs.tsukubai.ac.jp 3

4. Background l Effectiveness of Images l Visual information can explain word meanings. l Skyscrapers in Fig. 1 l Townscapes surrounded by mountains and rivers in Fig. 2 l Shenzhen and Dubai have the same NE aspect and have similarities in their images komada@mibel.cs.tsukubai.ac.jp 4

5. Research Objective l Image data corresponding to each word would provide rich information of word meanings. l Propose a method that utilizes images more effectively. l image data are obtained for each word. l Introduce a Gate mechanism l Control the extent to which the visual feature. komada@mibel.cs.tsukubai.ac.jp 5

6. Proposed Model l (Baseline) Character-based BiLSTM-CRF Model komada@mibel.cs.tsukubai.ac.jp 6

7. Proposed Model komada@mibel.cs.tsukubai.ac.jp 7 l Obtain Image Embeddings

8. Proposed Model komada@mibel.cs.tsukubai.ac.jp 8 l Image Retrieval

9. Proposed Model komada@mibel.cs.tsukubai.ac.jp 9 l Image Retrieval

10. Proposed Model komada@mibel.cs.tsukubai.ac.jp 10 l Obtaining Visual Embeddings

11. Proposed Model komada@mibel.cs.tsukubai.ac.jp 11 l Visual (Simple) Model

12. Proposed Model komada@mibel.cs.tsukubai.ac.jp 12 l Visual (Gate) Model

13. Experiments l Dataset l Extended Named Entity corpus l Images l Google Images with photo option l Queries are all nouns in documents. l Obtain top 15 images komada@mibel.cs.tsukubai.ac.jp 13

14. Experiments l Settings l Baseline: No use of visual features. l Visual (Simple): Uses the element-wise visual features. l Visual (Gate): Uses element-wise visual features with the Gate mechanism. komada@mibel.cs.tsukubai.ac.jp 14

15. Experimental Results l Results l Both models using element-wise visual features outperformed the baseline model. komada@mibel.cs.tsukubai.ac.jp 15

16. Experimental Results l Examples l (ex.1-E) Signed by 117 countries and two regions at the Final Protocol and Convention Signing Conference in [Jamaica Country] in December 1982. l (ex.2-E) Finally, tomorrow is the last day of our stay in France, except for the day we leave. We re going to [Avignon City]. komada@mibel.cs.tsukubai.ac.jp 16

17. Experimental Results l Unseen words l precision values improved most significantly (Seen: +5.08, Unseen: +7.07). l improve the performance of true-negatives. komada@mibel.cs.tsukubai.ac.jp 17

18. Conclusions l Propose an element-wise visual-enhanced NER model l Element-wise visual feature l Image retrieval l Gate mechanism l Achieved a higher F1-value performance than the baseline model l Future research l Investigate of effectiveness in other NE classes. l Improve our model by conducting elaborate query investigations that are motivated by the error analysis. l Attempt queries with nouns and adjectives/verbs. komada@mibel.cs.tsukubai.ac.jp 18

An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

Recommended

Recommended

More Related Content

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition (20)

More from Takashi Inui

More from Takashi Inui (6)

Recently uploaded

Recently uploaded (20)

An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition