SlideShare a Scribd company logo
1 of 30
1
EXTRA : Integrating External Knowledge into Multimodal
Hashtag Recommendation System
1 Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, South Korea
2 Department of Data Science, The Catholic University of Korea, Bucheon, South Korea
3 Department of Computer Science and Engineering, Korea University, Seoul, South Korea
Hyun-Sik Won1∗, Su-Min Roh1∗, Dohyun Kim3, Min-Ji Kim1, Huiju Kim3, and Kang-Min Kim12
* Equal contribution
Background
2
• Microblogging Platforms
Hashtags allow people to promote their own posts or categorize posts into specific topics.
Introduction
• Hashtag Recommendation Task
: Recommend appropriate hashtags related to the post.
• Previous Works
: Recommend hashtags by utilizing explicit information in the post (i.e., text, image, etc.).
: Recently, each user’s hashtag habits1 from the microblog are also utilized to recommend personalized hashtags.
3
1. Chen, Yu-Chi, et al. "Tagnet: triplet-attention graph networks for hashtag recommendation." IEEE Transactions on Circuits and Systems for Video Technology 32.3 (2021): 1148-1159.
Introduction
• Hashtag Recommendation Task
: Recommend appropriate hashtags related to the post.
• Previous Works
: Recommend hashtags by utilizing explicit information in the post (i.e., text, image, etc.).
: Recently, each user’s hashtag habits1 from the microblog are also utilized to recommend personalized hashtags.
However, these methods only utilize the information found in the microblog.
 This method may have difficulty understanding the contextual information of the post.
4
1. Chen, Yu-Chi, et al. "Tagnet: triplet-attention graph networks for hashtag recommendation." IEEE Transactions on Circuits and Systems for Video Technology 32.3 (2021): 1148-1159.
Introduction
• External Knowledge in Hashtag Recommendation
5
Introduction
6
• External Knowledge in Hashtag Recommendation
Does not appear directly
in the text or image.
Introduction
7
However, #dogtraining can be predicted based on external knowledge that dogs are typically trained.
• External Knowledge in Hashtag Recommendation
Does not appear directly
in the text or image.
Introduction
8
However, #dogtraining can be predicted based on external knowledge that dogs are typically trained.
• External Knowledge in Hashtag Recommendation
Does not appear directly
in the text or image.
How can we integrate the external knowledge to recommend
more relevant hashtags for the post?
Related Work
• Open Directory Project (ODP)
: This is a comprehensive directory of the World Wide Web, constructed and maintained by a community of
volunteer editors.
: It organizes web pages into the most related categories and subcategories.
(e.g., Health/Animal/Mammals/Dogs)
• There have been previous studies utilizing ODP categories as an external knowledge.
: The study1 trained a classifier to predict the category based on the description of the webpage.
9
1. Kim et al., “meChat: In-device Personal Assistant for Conversational Photo Sharing” IEEE Internet Computing 2019
Methodology
10
• Overview of the proposed model
Methodology
11
• Overview of the proposed model
① External Knowledge
Feature Extraction Model
Methodology
12
① External Knowledge
Feature Extraction Model
② Integrating External
Knowledge into
Multimodal Hashtag
Recommendation
System (EXTRA)
• Overview of the proposed model
Methodology
① External Knowledge Feature Extraction Model
13
Methodology
① External Knowledge Feature Extraction Model
14
Caption
Generator
A woman is holding a
certificate with a dog
Methodology
① External Knowledge Feature Extraction Model
15
ODP-based Classifier
(ODP-I)
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Recreation/Pets/Dogs/Breeds/Working_Group
Caption
Generator
A woman is holding a
certificate with a dog
Methodology
① External Knowledge Feature Extraction Model
16
ODP-based Classifier
(ODP-I)
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Recreation/Pets/Dogs/Breeds/Working_Group
Caption
Generator
A woman is holding a
certificate with a dog
Methodology
② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA)
17
Image of the post Text of the post Relevant Category of the Image
Algy is our other graduate. Today, Algy aced the see-saw and
weave poles so impressively with this obstacle awareness..
Recreation/Pets/Dogs/Breeds/Working_Group
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Methodology
② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA)
18
Image of the post Text of the post Relevant Category of the Image
…
Image Encoder
Algy is our other graduate. Today, Algy aced the see-saw and
weave poles so impressively with this obstacle awareness..
Recreation/Pets/Dogs/Breeds/Working_Group
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Methodology
② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA)
19
Image of the post Text of the post Relevant Category of the Image
Dogs Training Working Group
awareness
algy other …
is our obstacle
…
Image Encoder Text Encoder
Algy is our other graduate. Today, Algy aced the see-saw and
weave poles so impressively with this obstacle awareness..
Recreation/Pets/Dogs/Breeds/Working_Group
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Methodology
② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA)
20
Image of the post Text of the post Relevant Category of the Image
Dogs Training Working Group
awareness
algy other …
is our obstacle
…
Image Encoder Text Encoder
Multimodal Encoder
Algy is our other graduate. Today, Algy aced the see-saw and
weave poles so impressively with this obstacle awareness..
Recreation/Pets/Dogs/Breeds/Working_Group
Health/Animal/Mammals/Dogs
Recreation/Pets/Dogs/Training
Dataset
21
• Open Directory Project Dataset (for extracting external knowledge)
- # Web pages : 60,710
- # Categories : 2,531
• MaCon Dataset (for hashtag recommendation)
- # Posts : 624,520
- # Hashtags : 3,896
- # Average of hashtags per post : 9.3
Baselines
22
• Co-Attention (Co-AT)
: the model that sequentially generates text and image feature vectors from a post using a co-attention mechanism.
• MaCon
: the model that applies a parallel co-attention mechanism to combine the text and image features with users’
tagging habits.
• TAGNet
: the model that integrates text, visual, and user habit features through a triplet attention module by constructing
visual similarity graphs.
SOTA
Settings
23
• Implementation Details
- Model : Pre-trained FLAVA1
- Optimizer : AdamW
- Learning Rate : 1e-5 (10% warmup, cosine decay)
- Batch Size : 64
• Evaluation Metric
: Top-k Precision, Recall, F1-score
1. Singh, Amanpreet, et al. "Flava: A foundational language and vision alignment model." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
Experiments
24
• Comparison with Baselines
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
Top-1 Top-5 Top-9
F1-score
Co-AT MaCon TAGNet EXTRA (Ours)
 EXTRA achieved a 39.78% improvement over the state-of-the-art model TAGNet in terms of the Top-5 F1-Score.
Experiments
25
• Comparison Between External Knowledge From Each Modality
 The models that have integrated external knowledge perform better than those that have not.
Conclusions
26
• We propose EXTRA, a novel multimodal hashtag recommendation system that combines text and image with
external knowledge in a Transformer-based architecture.
• We employ the most relevant categories extracted from the image of the post as external knowledge using the
ODP-based classifier.
• Our approach outperformed the existing state-of-the-art methods on the MaCon dataset, demonstrating its
effectiveness in handling multimodal information for the hashtag recommendation task.
27
Experiments
28
• Illustrations of Recommended Hashtags
Predicted
Hashtag
(w/o ODP-I)
#lax #makeportraits #mountainlife
#plasticpollution #pursuitofportraits #spa
#greenery #travel #detail #blackandwhite
Predicted
Hashtag
(w/ ODP-I)
#makeportraits #plasticpollution
#mountainlife #adventure #lax
#pursuitofportraits #travel
#greatnorthcollective #nature
Experiments
29
• Comparison between Each Modality
Experiments
30
• Comparison between The Number of Subcategories

More Related Content

What's hot

Recurrent neural networks rnn
Recurrent neural networks   rnnRecurrent neural networks   rnn
Recurrent neural networks rnnKuppusamy P
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Challenges in nlp
Challenges in nlpChallenges in nlp
Challenges in nlpZareen Syed
 
Python vs MATLAB: Which one is the best language
Python vs MATLAB:  Which one is the best languagePython vs MATLAB:  Which one is the best language
Python vs MATLAB: Which one is the best languageStat Analytica
 
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELS
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELSTEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELS
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELSPrince Kumar
 
A product-focused introduction to Machine Learning
A product-focused introduction to Machine LearningA product-focused introduction to Machine Learning
A product-focused introduction to Machine LearningSatpreet Singh
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesKarol Chlasta
 
LSTM Based Sentiment Analysis
LSTM Based Sentiment AnalysisLSTM Based Sentiment Analysis
LSTM Based Sentiment Analysisijtsrd
 
Approaching (almost) Any NLP Problem
Approaching (almost) Any NLP ProblemApproaching (almost) Any NLP Problem
Approaching (almost) Any NLP ProblemAbhishek Thakur
 
Interfacing stepper motor
Interfacing stepper motorInterfacing stepper motor
Interfacing stepper motorPRADEEP
 

What's hot (12)

Recurrent neural networks rnn
Recurrent neural networks   rnnRecurrent neural networks   rnn
Recurrent neural networks rnn
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Challenges in nlp
Challenges in nlpChallenges in nlp
Challenges in nlp
 
Python vs MATLAB: Which one is the best language
Python vs MATLAB:  Which one is the best languagePython vs MATLAB:  Which one is the best language
Python vs MATLAB: Which one is the best language
 
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELS
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELSTEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELS
TEXT BASED PERSONALITY RESOLUTION USING MACHINE LEARNING MODELS
 
A product-focused introduction to Machine Learning
A product-focused introduction to Machine LearningA product-focused introduction to Machine Learning
A product-focused introduction to Machine Learning
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
 
LSTM Based Sentiment Analysis
LSTM Based Sentiment AnalysisLSTM Based Sentiment Analysis
LSTM Based Sentiment Analysis
 
Twitter Analytics
Twitter AnalyticsTwitter Analytics
Twitter Analytics
 
Self Organizing Maps
Self Organizing MapsSelf Organizing Maps
Self Organizing Maps
 
Approaching (almost) Any NLP Problem
Approaching (almost) Any NLP ProblemApproaching (almost) Any NLP Problem
Approaching (almost) Any NLP Problem
 
Interfacing stepper motor
Interfacing stepper motorInterfacing stepper motor
Interfacing stepper motor
 

Similar to EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation System

Teacher training material
Teacher training materialTeacher training material
Teacher training materialVikram Parmar
 
From Chalkboards to Chatbots: A Deep Dive into AI for Education Workshop
From Chalkboards to Chatbots: A Deep Dive into AI for Education WorkshopFrom Chalkboards to Chatbots: A Deep Dive into AI for Education Workshop
From Chalkboards to Chatbots: A Deep Dive into AI for Education WorkshopStella Lee
 
Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learningShareDocView.com
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning AppendixZitao Liu
 
2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...Ed Chi
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSumit Raj
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
Leveraging social media for training object detectors
Leveraging social media for training object detectorsLeveraging social media for training object detectors
Leveraging social media for training object detectorsManish Kumar
 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine LearningSharjeel Imtiaz
 
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...Egyptian Engineers Association
 
Machine Learning ass. of tanumalakar.pdf
Machine Learning ass. of tanumalakar.pdfMachine Learning ass. of tanumalakar.pdf
Machine Learning ass. of tanumalakar.pdfDYDF
 
Essential concepts for machine learning
Essential concepts for machine learning Essential concepts for machine learning
Essential concepts for machine learning pyingkodi maran
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellSri Ambati
 
AI AND EDUCATIO
AI AND EDUCATIOAI AND EDUCATIO
AI AND EDUCATIOno0orcom
 
Introduction to Recommender System
Introduction to Recommender SystemIntroduction to Recommender System
Introduction to Recommender SystemWQ Fan
 
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Alok Singh
 

Similar to EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation System (20)

Teacher training material
Teacher training materialTeacher training material
Teacher training material
 
From Chalkboards to Chatbots: A Deep Dive into AI for Education Workshop
From Chalkboards to Chatbots: A Deep Dive into AI for Education WorkshopFrom Chalkboards to Chatbots: A Deep Dive into AI for Education Workshop
From Chalkboards to Chatbots: A Deep Dive into AI for Education Workshop
 
ODSC APAC 2022 - Explainable AI
ODSC APAC 2022 - Explainable AIODSC APAC 2022 - Explainable AI
ODSC APAC 2022 - Explainable AI
 
Andrew NG machine learning
Andrew NG machine learningAndrew NG machine learning
Andrew NG machine learning
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning Appendix
 
2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
Leveraging social media for training object detectors
Leveraging social media for training object detectorsLeveraging social media for training object detectors
Leveraging social media for training object detectors
 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine Learning
 
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
 
Machine Learning ass. of tanumalakar.pdf
Machine Learning ass. of tanumalakar.pdfMachine Learning ass. of tanumalakar.pdf
Machine Learning ass. of tanumalakar.pdf
 
Essential concepts for machine learning
Essential concepts for machine learning Essential concepts for machine learning
Essential concepts for machine learning
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
 
AI AND EDUCATIO
AI AND EDUCATIOAI AND EDUCATIO
AI AND EDUCATIO
 
Data-X-Sparse-v2
Data-X-Sparse-v2Data-X-Sparse-v2
Data-X-Sparse-v2
 
Introduction to Recommender System
Introduction to Recommender SystemIntroduction to Recommender System
Introduction to Recommender System
 
Active learning
Active learningActive learning
Active learning
 
Recsys 2016
Recsys 2016Recsys 2016
Recsys 2016
 
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
 

Recently uploaded

Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...NETWAYS
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...NETWAYS
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...NETWAYS
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxFamilyWorshipCenterD
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfakankshagupta7348026
 
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrSaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrsaastr
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@vikas rana
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptssuser319dad
 
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
LANDMARKS  AND MONUMENTS IN NIGERIA.pptxLANDMARKS  AND MONUMENTS IN NIGERIA.pptx
LANDMARKS AND MONUMENTS IN NIGERIA.pptxBasil Achie
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...NETWAYS
 

Recently uploaded (20)

Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdf
 
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrSaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.ppt
 
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
LANDMARKS  AND MONUMENTS IN NIGERIA.pptxLANDMARKS  AND MONUMENTS IN NIGERIA.pptx
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
 

EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation System

  • 1. 1 EXTRA : Integrating External Knowledge into Multimodal Hashtag Recommendation System 1 Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, South Korea 2 Department of Data Science, The Catholic University of Korea, Bucheon, South Korea 3 Department of Computer Science and Engineering, Korea University, Seoul, South Korea Hyun-Sik Won1∗, Su-Min Roh1∗, Dohyun Kim3, Min-Ji Kim1, Huiju Kim3, and Kang-Min Kim12 * Equal contribution
  • 2. Background 2 • Microblogging Platforms Hashtags allow people to promote their own posts or categorize posts into specific topics.
  • 3. Introduction • Hashtag Recommendation Task : Recommend appropriate hashtags related to the post. • Previous Works : Recommend hashtags by utilizing explicit information in the post (i.e., text, image, etc.). : Recently, each user’s hashtag habits1 from the microblog are also utilized to recommend personalized hashtags. 3 1. Chen, Yu-Chi, et al. "Tagnet: triplet-attention graph networks for hashtag recommendation." IEEE Transactions on Circuits and Systems for Video Technology 32.3 (2021): 1148-1159.
  • 4. Introduction • Hashtag Recommendation Task : Recommend appropriate hashtags related to the post. • Previous Works : Recommend hashtags by utilizing explicit information in the post (i.e., text, image, etc.). : Recently, each user’s hashtag habits1 from the microblog are also utilized to recommend personalized hashtags. However, these methods only utilize the information found in the microblog.  This method may have difficulty understanding the contextual information of the post. 4 1. Chen, Yu-Chi, et al. "Tagnet: triplet-attention graph networks for hashtag recommendation." IEEE Transactions on Circuits and Systems for Video Technology 32.3 (2021): 1148-1159.
  • 5. Introduction • External Knowledge in Hashtag Recommendation 5
  • 6. Introduction 6 • External Knowledge in Hashtag Recommendation Does not appear directly in the text or image.
  • 7. Introduction 7 However, #dogtraining can be predicted based on external knowledge that dogs are typically trained. • External Knowledge in Hashtag Recommendation Does not appear directly in the text or image.
  • 8. Introduction 8 However, #dogtraining can be predicted based on external knowledge that dogs are typically trained. • External Knowledge in Hashtag Recommendation Does not appear directly in the text or image. How can we integrate the external knowledge to recommend more relevant hashtags for the post?
  • 9. Related Work • Open Directory Project (ODP) : This is a comprehensive directory of the World Wide Web, constructed and maintained by a community of volunteer editors. : It organizes web pages into the most related categories and subcategories. (e.g., Health/Animal/Mammals/Dogs) • There have been previous studies utilizing ODP categories as an external knowledge. : The study1 trained a classifier to predict the category based on the description of the webpage. 9 1. Kim et al., “meChat: In-device Personal Assistant for Conversational Photo Sharing” IEEE Internet Computing 2019
  • 10. Methodology 10 • Overview of the proposed model
  • 11. Methodology 11 • Overview of the proposed model ① External Knowledge Feature Extraction Model
  • 12. Methodology 12 ① External Knowledge Feature Extraction Model ② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA) • Overview of the proposed model
  • 13. Methodology ① External Knowledge Feature Extraction Model 13
  • 14. Methodology ① External Knowledge Feature Extraction Model 14 Caption Generator A woman is holding a certificate with a dog
  • 15. Methodology ① External Knowledge Feature Extraction Model 15 ODP-based Classifier (ODP-I) Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training Recreation/Pets/Dogs/Breeds/Working_Group Caption Generator A woman is holding a certificate with a dog
  • 16. Methodology ① External Knowledge Feature Extraction Model 16 ODP-based Classifier (ODP-I) Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training Recreation/Pets/Dogs/Breeds/Working_Group Caption Generator A woman is holding a certificate with a dog
  • 17. Methodology ② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA) 17 Image of the post Text of the post Relevant Category of the Image Algy is our other graduate. Today, Algy aced the see-saw and weave poles so impressively with this obstacle awareness.. Recreation/Pets/Dogs/Breeds/Working_Group Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training
  • 18. Methodology ② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA) 18 Image of the post Text of the post Relevant Category of the Image … Image Encoder Algy is our other graduate. Today, Algy aced the see-saw and weave poles so impressively with this obstacle awareness.. Recreation/Pets/Dogs/Breeds/Working_Group Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training
  • 19. Methodology ② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA) 19 Image of the post Text of the post Relevant Category of the Image Dogs Training Working Group awareness algy other … is our obstacle … Image Encoder Text Encoder Algy is our other graduate. Today, Algy aced the see-saw and weave poles so impressively with this obstacle awareness.. Recreation/Pets/Dogs/Breeds/Working_Group Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training
  • 20. Methodology ② Integrating External Knowledge into Multimodal Hashtag Recommendation System (EXTRA) 20 Image of the post Text of the post Relevant Category of the Image Dogs Training Working Group awareness algy other … is our obstacle … Image Encoder Text Encoder Multimodal Encoder Algy is our other graduate. Today, Algy aced the see-saw and weave poles so impressively with this obstacle awareness.. Recreation/Pets/Dogs/Breeds/Working_Group Health/Animal/Mammals/Dogs Recreation/Pets/Dogs/Training
  • 21. Dataset 21 • Open Directory Project Dataset (for extracting external knowledge) - # Web pages : 60,710 - # Categories : 2,531 • MaCon Dataset (for hashtag recommendation) - # Posts : 624,520 - # Hashtags : 3,896 - # Average of hashtags per post : 9.3
  • 22. Baselines 22 • Co-Attention (Co-AT) : the model that sequentially generates text and image feature vectors from a post using a co-attention mechanism. • MaCon : the model that applies a parallel co-attention mechanism to combine the text and image features with users’ tagging habits. • TAGNet : the model that integrates text, visual, and user habit features through a triplet attention module by constructing visual similarity graphs. SOTA
  • 23. Settings 23 • Implementation Details - Model : Pre-trained FLAVA1 - Optimizer : AdamW - Learning Rate : 1e-5 (10% warmup, cosine decay) - Batch Size : 64 • Evaluation Metric : Top-k Precision, Recall, F1-score 1. Singh, Amanpreet, et al. "Flava: A foundational language and vision alignment model." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
  • 24. Experiments 24 • Comparison with Baselines 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Top-1 Top-5 Top-9 F1-score Co-AT MaCon TAGNet EXTRA (Ours)  EXTRA achieved a 39.78% improvement over the state-of-the-art model TAGNet in terms of the Top-5 F1-Score.
  • 25. Experiments 25 • Comparison Between External Knowledge From Each Modality  The models that have integrated external knowledge perform better than those that have not.
  • 26. Conclusions 26 • We propose EXTRA, a novel multimodal hashtag recommendation system that combines text and image with external knowledge in a Transformer-based architecture. • We employ the most relevant categories extracted from the image of the post as external knowledge using the ODP-based classifier. • Our approach outperformed the existing state-of-the-art methods on the MaCon dataset, demonstrating its effectiveness in handling multimodal information for the hashtag recommendation task.
  • 27. 27
  • 28. Experiments 28 • Illustrations of Recommended Hashtags Predicted Hashtag (w/o ODP-I) #lax #makeportraits #mountainlife #plasticpollution #pursuitofportraits #spa #greenery #travel #detail #blackandwhite Predicted Hashtag (w/ ODP-I) #makeportraits #plasticpollution #mountainlife #adventure #lax #pursuitofportraits #travel #greatnorthcollective #nature
  • 30. Experiments 30 • Comparison between The Number of Subcategories