SlideShare a Scribd company logo
1 of 18
Download to read offline
An Element-wise Visual-Enhanced
BiLSTM-CRF Model for Location
Name Recognition
Takuya Komada, Takashi Inui
Department of Computer Science
University of Tsukuba
1
komada@mibel.cs.tsukubai.ac.jp
Background
l Location Name
l Location information is one of the essential
components for some NLP applications.
l E.g. location name disambiguation , mapping of
location names to geographic locations.
l Named Entity Recognition (NER)
l Detect entities and classify each entity in pre-
defined types. e.g., LOC, PER, ORG.
l For example, Leading to [Tokyo LOC].
komada@mibel.cs.tsukubai.ac.jp
2
Background
l Multimodal NER
l Deep learning models using visual information.
l Extract named entities from image attached
social media posts. (Twitter, SnapChat, etc)
l Related works
l Moon, Lu, Zhang Proposed a neural NER model
using images attached to document.
l Only use image attached to the document.
komada@mibel.cs.tsukubai.ac.jp
3
Background
l Effectiveness of Images
l Visual information can explain word meanings.
l Skyscrapers in Fig. 1
l Townscapes surrounded by mountains and rivers in
Fig. 2
l Shenzhen and Dubai have the same NE aspect and
have similarities in their images
komada@mibel.cs.tsukubai.ac.jp
4
Research Objective
l Image data corresponding to each word
would provide rich information of word
meanings.
l Propose a method that utilizes images more
effectively.
l image data are obtained for each word.
l Introduce a Gate mechanism
l Control the extent to which the visual feature.
komada@mibel.cs.tsukubai.ac.jp
5
Proposed Model
l (Baseline) Character-based BiLSTM-CRF Model
komada@mibel.cs.tsukubai.ac.jp
6
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
7
l Obtain Image Embeddings
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
8
l Image Retrieval
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
9
l Image Retrieval
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
10
l Obtaining Visual Embeddings
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
11
l Visual (Simple) Model
Proposed Model
komada@mibel.cs.tsukubai.ac.jp
12
l Visual (Gate) Model
Experiments
l Dataset
l Extended Named Entity corpus
l Images
l Google Images with photo option
l Queries are all nouns in documents.
l Obtain top 15 images
komada@mibel.cs.tsukubai.ac.jp
13
Experiments
l Settings
l Baseline: No use of visual features.
l Visual (Simple): Uses the element-wise visual
features.
l Visual (Gate): Uses element-wise visual features
with the Gate mechanism.
komada@mibel.cs.tsukubai.ac.jp
14
Experimental Results
l Results
l Both models using element-wise visual features
outperformed the baseline model.
komada@mibel.cs.tsukubai.ac.jp
15
Experimental Results
l Examples
l (ex.1-E) Signed by 117 countries and two regions at the
Final Protocol and Convention Signing Conference in
[Jamaica Country] in December 1982.
l (ex.2-E) Finally, tomorrow is the last day of our stay in
France, except for the day we leave. We re going to
[Avignon City].
komada@mibel.cs.tsukubai.ac.jp
16
Experimental Results
l Unseen words
l precision values improved most significantly
(Seen: +5.08, Unseen: +7.07).
l improve the performance of true-negatives.
komada@mibel.cs.tsukubai.ac.jp
17
Conclusions
l Propose an element-wise visual-enhanced NER
model
l Element-wise visual feature
l Image retrieval
l Gate mechanism
l Achieved a higher F1-value performance than the
baseline model
l Future research
l Investigate of effectiveness in other NE classes.
l Improve our model by conducting elaborate query
investigations that are motivated by the error analysis.
l Attempt queries with nouns and adjectives/verbs.
komada@mibel.cs.tsukubai.ac.jp
18

More Related Content

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

Aj2418721874
Aj2418721874Aj2418721874
Aj2418721874IJMER
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and NowSi Krishan
 
Chapter 1- Introduction.ppt
Chapter 1- Introduction.pptChapter 1- Introduction.ppt
Chapter 1- Introduction.pptTigistTilahun1
 
Extraction of Buildings from Satellite Images
Extraction of Buildings from Satellite ImagesExtraction of Buildings from Satellite Images
Extraction of Buildings from Satellite ImagesAkanksha Prasad
 
Opps approch of software development
Opps approch of software developmentOpps approch of software development
Opps approch of software developmentRaja Babu
 
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHcsandit
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative AttributesVikas Jain
 
Automated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureAutomated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureIRJET Journal
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2Dr. Aparna Varde
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection PipelineAbhinav Dadhich
 
sachin presentation.pptx
sachin presentation.pptxsachin presentation.pptx
sachin presentation.pptxMareeswaranM7
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptxK Manjunath
 
System designing approaches
System designing approachesSystem designing approaches
System designing approachesJaipal Dhobale
 

Similar to An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition (20)

Eren_Golge_MS_Thesis_2014
Eren_Golge_MS_Thesis_2014Eren_Golge_MS_Thesis_2014
Eren_Golge_MS_Thesis_2014
 
Aj2418721874
Aj2418721874Aj2418721874
Aj2418721874
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and Now
 
ei2106-submit-opt-415
ei2106-submit-opt-415ei2106-submit-opt-415
ei2106-submit-opt-415
 
Chapter 1- Introduction.ppt
Chapter 1- Introduction.pptChapter 1- Introduction.ppt
Chapter 1- Introduction.ppt
 
07slide.ppt
07slide.ppt07slide.ppt
07slide.ppt
 
Extraction of Buildings from Satellite Images
Extraction of Buildings from Satellite ImagesExtraction of Buildings from Satellite Images
Extraction of Buildings from Satellite Images
 
Opps approch of software development
Opps approch of software developmentOpps approch of software development
Opps approch of software development
 
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative Attributes
 
Automated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureAutomated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU Architecture
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
OCL3_10_05.pptx
OCL3_10_05.pptxOCL3_10_05.pptx
OCL3_10_05.pptx
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 2
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection Pipeline
 
sachin presentation.pptx
sachin presentation.pptxsachin presentation.pptx
sachin presentation.pptx
 
Ch05lect1 ud
Ch05lect1 udCh05lect1 ud
Ch05lect1 ud
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
System designing approaches
System designing approachesSystem designing approaches
System designing approaches
 

More from Takashi Inui

Twitter 投稿における定性的時間表現の使用時刻分布調査
Twitter 投稿における定性的時間表現の使用時刻分布調査Twitter 投稿における定性的時間表現の使用時刻分布調査
Twitter 投稿における定性的時間表現の使用時刻分布調査Takashi Inui
 
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデルTakashi Inui
 
新聞記事中の地名に対する地理的位置推定における有効な素性の調査
新聞記事中の地名に対する地理的位置推定における有効な素性の調査新聞記事中の地名に対する地理的位置推定における有効な素性の調査
新聞記事中の地名に対する地理的位置推定における有効な素性の調査Takashi Inui
 
バースト検知手法を用いた桜の見頃推定
バースト検知手法を用いた桜の見頃推定バースト検知手法を用いた桜の見頃推定
バースト検知手法を用いた桜の見頃推定Takashi Inui
 
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...Takashi Inui
 
ニューラル日本語固有表現認識における格フレームの有効性検証
ニューラル日本語固有表現認識における格フレームの有効性検証ニューラル日本語固有表現認識における格フレームの有効性検証
ニューラル日本語固有表現認識における格フレームの有効性検証Takashi Inui
 

More from Takashi Inui (6)

Twitter 投稿における定性的時間表現の使用時刻分布調査
Twitter 投稿における定性的時間表現の使用時刻分布調査Twitter 投稿における定性的時間表現の使用時刻分布調査
Twitter 投稿における定性的時間表現の使用時刻分布調査
 
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル
地理的知識グラフを取り込んだニューラル文書ジオロケーションモデル
 
新聞記事中の地名に対する地理的位置推定における有効な素性の調査
新聞記事中の地名に対する地理的位置推定における有効な素性の調査新聞記事中の地名に対する地理的位置推定における有効な素性の調査
新聞記事中の地名に対する地理的位置推定における有効な素性の調査
 
バースト検知手法を用いた桜の見頃推定
バースト検知手法を用いた桜の見頃推定バースト検知手法を用いた桜の見頃推定
バースト検知手法を用いた桜の見頃推定
 
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...
ILP-based Opinion Sentence Extraction from User Reviews for Question DB Const...
 
ニューラル日本語固有表現認識における格フレームの有効性検証
ニューラル日本語固有表現認識における格フレームの有効性検証ニューラル日本語固有表現認識における格フレームの有効性検証
ニューラル日本語固有表現認識における格フレームの有効性検証
 

Recently uploaded

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 

Recently uploaded (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 

An Element-wise Visual-enhanced BiLSTM-CRF Model for Location Name Recognition

  • 1. An Element-wise Visual-Enhanced BiLSTM-CRF Model for Location Name Recognition Takuya Komada, Takashi Inui Department of Computer Science University of Tsukuba 1 komada@mibel.cs.tsukubai.ac.jp
  • 2. Background l Location Name l Location information is one of the essential components for some NLP applications. l E.g. location name disambiguation , mapping of location names to geographic locations. l Named Entity Recognition (NER) l Detect entities and classify each entity in pre- defined types. e.g., LOC, PER, ORG. l For example, Leading to [Tokyo LOC]. komada@mibel.cs.tsukubai.ac.jp 2
  • 3. Background l Multimodal NER l Deep learning models using visual information. l Extract named entities from image attached social media posts. (Twitter, SnapChat, etc) l Related works l Moon, Lu, Zhang Proposed a neural NER model using images attached to document. l Only use image attached to the document. komada@mibel.cs.tsukubai.ac.jp 3
  • 4. Background l Effectiveness of Images l Visual information can explain word meanings. l Skyscrapers in Fig. 1 l Townscapes surrounded by mountains and rivers in Fig. 2 l Shenzhen and Dubai have the same NE aspect and have similarities in their images komada@mibel.cs.tsukubai.ac.jp 4
  • 5. Research Objective l Image data corresponding to each word would provide rich information of word meanings. l Propose a method that utilizes images more effectively. l image data are obtained for each word. l Introduce a Gate mechanism l Control the extent to which the visual feature. komada@mibel.cs.tsukubai.ac.jp 5
  • 6. Proposed Model l (Baseline) Character-based BiLSTM-CRF Model komada@mibel.cs.tsukubai.ac.jp 6
  • 13. Experiments l Dataset l Extended Named Entity corpus l Images l Google Images with photo option l Queries are all nouns in documents. l Obtain top 15 images komada@mibel.cs.tsukubai.ac.jp 13
  • 14. Experiments l Settings l Baseline: No use of visual features. l Visual (Simple): Uses the element-wise visual features. l Visual (Gate): Uses element-wise visual features with the Gate mechanism. komada@mibel.cs.tsukubai.ac.jp 14
  • 15. Experimental Results l Results l Both models using element-wise visual features outperformed the baseline model. komada@mibel.cs.tsukubai.ac.jp 15
  • 16. Experimental Results l Examples l (ex.1-E) Signed by 117 countries and two regions at the Final Protocol and Convention Signing Conference in [Jamaica Country] in December 1982. l (ex.2-E) Finally, tomorrow is the last day of our stay in France, except for the day we leave. We re going to [Avignon City]. komada@mibel.cs.tsukubai.ac.jp 16
  • 17. Experimental Results l Unseen words l precision values improved most significantly (Seen: +5.08, Unseen: +7.07). l improve the performance of true-negatives. komada@mibel.cs.tsukubai.ac.jp 17
  • 18. Conclusions l Propose an element-wise visual-enhanced NER model l Element-wise visual feature l Image retrieval l Gate mechanism l Achieved a higher F1-value performance than the baseline model l Future research l Investigate of effectiveness in other NE classes. l Improve our model by conducting elaborate query investigations that are motivated by the error analysis. l Attempt queries with nouns and adjectives/verbs. komada@mibel.cs.tsukubai.ac.jp 18