SlideShare a Scribd company logo
Netizen-Style Commenting on
Fashion Photos: Dataset and
Diversity Measures
TheWebConf (WWW) 2018
Abstract
• Sentences generated by current works describe shallow appearances
and are boring.
• Netizen Style Commenting automatically generate characteristic
comments to a user-contributed fashion photo.
• Three major component:
• Construct a large-scale clothing dataset
• Marry topic models with neural networks
• Propose three unique measures to estimate the diversity of comments
• Improve accuracy and diversity
Outline
1. Introduction
2. Related work
3. Dataset - Netilook
4. Method - Netizen Style Commenting
5. Diversity measures
6. Experiment
7. Conclusion
Introduction
• Modern model can achieve good scores in machine translation
metrics but are short of humanity.
• Collect a large corpus of paired user-contributed fashion photos and
comments, called NetiLook
• Existing models may overfit the dataset and generate comment like
“love the ….”.
• Integrate latent topic models with state-of-the-art methods and
make the generated sentence vivacious.
• Propose performance measurement for diversity.
Introduction (cont.)
Microsoft CaptionBot Netizen
Introduction (cont.)
Related work
• Image caption help visually impaired
users and human-robot interaction.
• State-of-the-art model are majorly
attention-based models because they
focus on correctness of description.
• Compared with depicting images,
giving comments is more challenging
because it needs to not only
understand images but take care of
engagement with users.
(Jonghwan Mun , AAAI 2017)
Dataset - Netilook
• Collect photos and comments from
Lookbook to construct NetiLook.
Method - Netizen Style Commenting
• Some frequently used sentences along with posts (e.g., “love this!”,
“nice”) which cause current models inclined to generate similar
sentences.
Method - Netizen Style Commenting (cont.)
• Introduce style-weight wstyle element-wised multiplied (◦) with
outputs at each step of LSTM to season generated sentences.
• Style-weight wstyle represents the comment style, which teaches
models to be acquainted with style in the corpus while generating
captioning.
Method - Netizen Style Commenting (cont.)
• Abstract concepts are hard for people to give a specific
definition.
• Apply LDA to discover latent topics and fuse with current models.
• LDA:
• Topic-word vectors:
• Comment-topic vectors:
• N: word dictionary
• z: topics
• m: comments
Method - Netizen Style Commenting (cont.)
• To find the topic distribution in corpus, each comment votes the
topic with highest probability by .
• The voting gives the most characteristic style in the corpus:
• The topic distribution of the corpus:
Method - Netizen Style Commenting (cont.)
• With the topic distribution of corpus y and topic-word vectors ϕ,
our style-weight wstyle is now defined as:
where yk means the k-th dimension of y
Diversity measures
• BLEU and METEOR are not for diversity measure, diversity measures
are being put importance on sentence generation models.
• More diverse sentences are generated, more unique words are
used.
• DicRate: ratio of unique words in ground truth and generations.
Diversity measures (cont.)
• WF-KL: The KL divergence of word frequency distribution.
• Frequency distribution:
• KL:
Diversity measures (cont.)
• POS-KL: The KL divergence of part-of-speech (POS) distribution.
• Frequency distribution:
• KL:
Experiment
• Setting: Beam size= 3; k= 3 or 5
• Topic models would not benefit the attention-based approach for
the reason that attention-based models are greatly restricted the
word selection.
Experiment (cont.)
• For a comment given by a human or machine, it is difficult to be
evaluated on conventional measures such as BLEU in NetiLook.
• Netilook has much more diversity and unique words than other
datasets.
Experiment (cont.)
• There are some common words and general patterns to describe
and comment on the clothing style in comparison with Flickr30k.
• In NetiLook, the experiment in Table 3 shows that our method can
greatly improve the diversity.
Experiment (cont.)
Experiment (cont.)
• User study:
• about 25 year-old and familiar with netizen style community
• 2.83 males/female
Conclusion
• Style-weight that greatly influences on current captioning models to
immerse into human online society.
• Proposed approaches benefit fashion photo commenting and
improve image captioning task.
• The approach could be applied on other fields to help generate
sentences with various styles by the idea of style-weight.

More Related Content

Similar to Netizen style commenting on fashion photos

What is word2vec?
What is word2vec?What is word2vec?
What is word2vec?
Traian Rebedea
 
Domain Modeling for Personalized Learning
Domain Modeling for Personalized LearningDomain Modeling for Personalized Learning
Domain Modeling for Personalized Learning
Peter Brusilovsky
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Lucidworks
 
Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
Ashis Kumar Chanda
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning Appendix
Zitao Liu
 
Training Researchers with the MOVING Platform
Training Researchers with the MOVING PlatformTraining Researchers with the MOVING Platform
Training Researchers with the MOVING Platform
Iacopo Vagliano
 
1.2 Motivating Challenges As mentioned earlier, traditional data
1.2 Motivating Challenges As mentioned earlier, traditional data1.2 Motivating Challenges As mentioned earlier, traditional data
1.2 Motivating Challenges As mentioned earlier, traditional data
SantosConleyha
 
CIS 111 Focus Dreams/newtonhelp.com
CIS 111 Focus Dreams/newtonhelp.comCIS 111 Focus Dreams/newtonhelp.com
CIS 111 Focus Dreams/newtonhelp.com
bellflower82
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESS
Kishan Patel
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Matthew Lease
 
Slides ecir2016
Slides ecir2016Slides ecir2016
Slides ecir2016
Fattane Zarrinkalam
 
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stack
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stackLow Cost Business Intelligence Platform for MongoDB instances using MEAN stack
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stack
Avinash Kaza
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
Shilpa Subrahmanyam
 
010821+presentation+oti.ppt
010821+presentation+oti.ppt010821+presentation+oti.ppt
010821+presentation+oti.ppt
Yann-Gaël Guéhéneuc
 
Machine_learning_presentation_on_movie_recomendation_system.pptx
Machine_learning_presentation_on_movie_recomendation_system.pptxMachine_learning_presentation_on_movie_recomendation_system.pptx
Machine_learning_presentation_on_movie_recomendation_system.pptx
arunchoubeybxr
 
Tomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLPTomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLP
Machine Learning Prague
 
artrec.pptx
artrec.pptxartrec.pptx
artrec.pptx
AuraHub
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
Rinke Hoekstra
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
NBER
 

Similar to Netizen style commenting on fashion photos (20)

What is word2vec?
What is word2vec?What is word2vec?
What is word2vec?
 
Domain Modeling for Personalized Learning
Domain Modeling for Personalized LearningDomain Modeling for Personalized Learning
Domain Modeling for Personalized Learning
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
 
Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
 
Interactive Machine Learning Appendix
Interactive  Machine Learning AppendixInteractive  Machine Learning Appendix
Interactive Machine Learning Appendix
 
Training Researchers with the MOVING Platform
Training Researchers with the MOVING PlatformTraining Researchers with the MOVING Platform
Training Researchers with the MOVING Platform
 
1.2 Motivating Challenges As mentioned earlier, traditional data
1.2 Motivating Challenges As mentioned earlier, traditional data1.2 Motivating Challenges As mentioned earlier, traditional data
1.2 Motivating Challenges As mentioned earlier, traditional data
 
CIS 111 Focus Dreams/newtonhelp.com
CIS 111 Focus Dreams/newtonhelp.comCIS 111 Focus Dreams/newtonhelp.com
CIS 111 Focus Dreams/newtonhelp.com
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESS
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
 
Slides ecir2016
Slides ecir2016Slides ecir2016
Slides ecir2016
 
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stack
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stackLow Cost Business Intelligence Platform for MongoDB instances using MEAN stack
Low Cost Business Intelligence Platform for MongoDB instances using MEAN stack
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
 
010821+presentation+oti.ppt
010821+presentation+oti.ppt010821+presentation+oti.ppt
010821+presentation+oti.ppt
 
Machine_learning_presentation_on_movie_recomendation_system.pptx
Machine_learning_presentation_on_movie_recomendation_system.pptxMachine_learning_presentation_on_movie_recomendation_system.pptx
Machine_learning_presentation_on_movie_recomendation_system.pptx
 
Tomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLPTomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLP
 
Facs online tutorial
Facs online tutorialFacs online tutorial
Facs online tutorial
 
artrec.pptx
artrec.pptxartrec.pptx
artrec.pptx
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 

Recently uploaded

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 

Recently uploaded (20)

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 

Netizen style commenting on fashion photos

  • 1. Netizen-Style Commenting on Fashion Photos: Dataset and Diversity Measures TheWebConf (WWW) 2018
  • 2. Abstract • Sentences generated by current works describe shallow appearances and are boring. • Netizen Style Commenting automatically generate characteristic comments to a user-contributed fashion photo. • Three major component: • Construct a large-scale clothing dataset • Marry topic models with neural networks • Propose three unique measures to estimate the diversity of comments • Improve accuracy and diversity
  • 3. Outline 1. Introduction 2. Related work 3. Dataset - Netilook 4. Method - Netizen Style Commenting 5. Diversity measures 6. Experiment 7. Conclusion
  • 4. Introduction • Modern model can achieve good scores in machine translation metrics but are short of humanity. • Collect a large corpus of paired user-contributed fashion photos and comments, called NetiLook • Existing models may overfit the dataset and generate comment like “love the ….”. • Integrate latent topic models with state-of-the-art methods and make the generated sentence vivacious. • Propose performance measurement for diversity.
  • 7. Related work • Image caption help visually impaired users and human-robot interaction. • State-of-the-art model are majorly attention-based models because they focus on correctness of description. • Compared with depicting images, giving comments is more challenging because it needs to not only understand images but take care of engagement with users. (Jonghwan Mun , AAAI 2017)
  • 8. Dataset - Netilook • Collect photos and comments from Lookbook to construct NetiLook.
  • 9. Method - Netizen Style Commenting • Some frequently used sentences along with posts (e.g., “love this!”, “nice”) which cause current models inclined to generate similar sentences.
  • 10. Method - Netizen Style Commenting (cont.) • Introduce style-weight wstyle element-wised multiplied (◦) with outputs at each step of LSTM to season generated sentences. • Style-weight wstyle represents the comment style, which teaches models to be acquainted with style in the corpus while generating captioning.
  • 11. Method - Netizen Style Commenting (cont.) • Abstract concepts are hard for people to give a specific definition. • Apply LDA to discover latent topics and fuse with current models. • LDA: • Topic-word vectors: • Comment-topic vectors: • N: word dictionary • z: topics • m: comments
  • 12. Method - Netizen Style Commenting (cont.) • To find the topic distribution in corpus, each comment votes the topic with highest probability by . • The voting gives the most characteristic style in the corpus: • The topic distribution of the corpus:
  • 13. Method - Netizen Style Commenting (cont.) • With the topic distribution of corpus y and topic-word vectors ϕ, our style-weight wstyle is now defined as: where yk means the k-th dimension of y
  • 14. Diversity measures • BLEU and METEOR are not for diversity measure, diversity measures are being put importance on sentence generation models. • More diverse sentences are generated, more unique words are used. • DicRate: ratio of unique words in ground truth and generations.
  • 15. Diversity measures (cont.) • WF-KL: The KL divergence of word frequency distribution. • Frequency distribution: • KL:
  • 16. Diversity measures (cont.) • POS-KL: The KL divergence of part-of-speech (POS) distribution. • Frequency distribution: • KL:
  • 17. Experiment • Setting: Beam size= 3; k= 3 or 5 • Topic models would not benefit the attention-based approach for the reason that attention-based models are greatly restricted the word selection.
  • 18. Experiment (cont.) • For a comment given by a human or machine, it is difficult to be evaluated on conventional measures such as BLEU in NetiLook. • Netilook has much more diversity and unique words than other datasets.
  • 19. Experiment (cont.) • There are some common words and general patterns to describe and comment on the clothing style in comparison with Flickr30k. • In NetiLook, the experiment in Table 3 shows that our method can greatly improve the diversity.
  • 21. Experiment (cont.) • User study: • about 25 year-old and familiar with netizen style community • 2.83 males/female
  • 22. Conclusion • Style-weight that greatly influences on current captioning models to immerse into human online society. • Proposed approaches benefit fashion photo commenting and improve image captioning task. • The approach could be applied on other fields to help generate sentences with various styles by the idea of style-weight.