SlideShare a Scribd company logo
1 of 25
Content Complexity, Similarity, and
Consistency in Social Media:
A Deep Learning Approach
Gene Moo Lee
University of Texas at Arlington
Joint work with
Donghyuk Shin (UT Austin/Amazon), Shu He (UConn),
Andrew B. Whinston (UT Austin)
DSI 2016, Austin TX
Social media: More users
2
Social media: More spending
3
Challenges and opportunities: 78% photos
4
Source: Chang et al. 2014
Research questions
• How can firms optimize social media strategies by
incorporating visual content?
• Specifically, what are the determinants of consumer
engagement in terms of “likes” and “reblogs” (sharing)
actions?
• How visual and textual contents play role?
• Operationally, how to construct measures on these
unstructured data sources?
5
Tumblr data
• Tumblr: microblogging platform (acquired by Yahoo!)
• 35,651 posts by 183 companies (May - Oct 2014)
• Automobile, Entertainment, Food, Fashion,
Finance, Leisure, Retail, Tech
• 89.7% photo & text, 6.3% pure text, 4% videos
• Collected “likes” and “reblogs” until Apr 2015
6
Company blogs in Tumblr
7
BMW USA Vogue IBM
Data: blog post and engagement
8
Post = Visual Info (Image) + Textual Info (Text, Tags)
Customer engagement = Notes (Likes + Reblogs)
Visual features
• Aesthetics (beautiful photos)
• Adult-contents
• Celebrity
• Feature complexity (low-level, flashy images)
• Semantic complexity (high-level, complex meaning)
• Number of salient objects
9
Feature complexity (low level)
• Visual complexity theory [Donderi 2006a, Pieters et al. 2010]
• Visually complex (flashy) images (colors, luminance,
shape) gets more attention
• This feature complexity can be captured by the
image’s compressed file size [Donderi 2006a; Donderi
2006b; Machado et al. 2015; Forsythe et al. 2011]
• However, this complexity can only capture low-level
complexity based on “pixel” values
10
Semantic complexity (high level)
• Recognition-By-Components theory [Biederman 1987]
• Human object recognition is invariant to feature
factors (colors, brightness, edges, positions, etc.)
• Vessel and Rubin (2010) show that visual preferences
are influenced by semantic content in the image
• We posit that semantic complexity matters!
• Operational question: How do we calculate semantics
from unstructured images?
11
Deep learning
• A branch of machine learning, inspired by human brain
• Algorithms to model high-level abstractions with multiple processing
layers of non-linear transformations
• (1) theoretical breakthroughs, (2) Big Data, (3) powerful computation
• Successfully applied in image/video/voice recognition, AlphaGo, etc.
12
Semantic complexity via deep learning
• Deep convolutional neural network (CNN) [Jia et al. 2014]
• Model trained with 1.2 million images with tags (ImageNet, Flickr)
• Tested on 53,417 images from brand-generated Tumblr posts
• Each image is represented by a 1,700 dimensional vector, where each
value is the confidence score w.r.t. an object (tag)
• We define semantic complexity as the Shannon Diversity Index (entropy)
on the 1,700-dimensional vector
• max = log(d), if p is uniformly distributed
• min = 0, if p_i = 1 for some i
13
ImageNet: Image DB with tree-structure tags
14
Source: ImageNet
More visual features
• 7th-layer output = robust representation of the image for “computer vision” tasks
• Aesthetic/beauty score [Dhar et al. 2011 (CVPR, Vision)]
• Adult-content score [Sengamedu et al. 2011 (MM, Vision)]
• Celebrity (450 celebrities) [Parhki et al. 2015 (BMV, Vision)]
• Number of salient objects [Zhang et al. 2015 (CVPR, Vision)]
15
Examples: Visual features
• Visual complexity theory (Attneave 1994,
Donderi 2006, Pieters et al. 2010)
• Visual stimuli are a composite of
colors,luminance, shape, number of
objects/patterns
16
Textual features
• Two textual sources: text and tags
• Length: # of words, # of tags
• Topic complexity: LDA topic model (text, tags)
• Order complexity: word2vec (for text only)
17
Examples: Textual features
• Topics
• Word clusters
18
Visual-Textual Content Similarity
• Image: pixels, Text/Tags: characters
— Need a common representation!
1. Represent each image as a collection of the predicted labels
obtained from deep learning — “image corpus”
2. Train LDA with both image and text/tags corpora — topic
distribution for images and text/tags
3. Cosine similarity between the two corresponding topic
distribution
19
Examples: Content similarity
• Topics
• Word clusters
20
21
Empirical Model
• Linear fixed effects model
• DV (likes/reblogs): take log transformation due to their
skewed distributions
• Capture blog (firm) heterogeneity
• Capture time effects (day of week, month)
• Other models
• Identical results with random effects
• Consistent results with negative binomial model
22
23
Summary and implications
1. Large-scale analysis on visual content in social
media
2. New visual semantic complexity via deep learning
• Able to relate visual and textual content
Visual content analysis can be used to optimize
content design for social media marketing
24
Thank you!
Contact Info: Gene Moo Lee
gene.lee@uta.edu

More Related Content

Similar to Content Complexity, Similarity, and Consistency in Social Media: A Deep Learning Approach

A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
Journal For Research
 
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
Oge Marques
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
researchinventy
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Science
researchinventy
 
Techniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From ImagesTechniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From Images
Jill Crawford
 

Similar to Content Complexity, Similarity, and Consistency in Social Media: A Deep Learning Approach (20)

research paper
research paperresearch paper
research paper
 
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
A NOVEL WEB IMAGE RE-RANKING APPROACH BASED ON QUERY SPECIFIC SEMANTIC SIGNAT...
 
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
Oge Marques (FAU) - invited talk at WISMA 2010 (Barcelona, May 2010)
 
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
 
Leveraging social media for training object detectors
Leveraging social media for training object detectorsLeveraging social media for training object detectors
Leveraging social media for training object detectors
 
Rae
RaeRae
Rae
 
Introduction to OpenSemcq
Introduction to OpenSemcqIntroduction to OpenSemcq
Introduction to OpenSemcq
 
Towards Advanced Business Analytics using Text Mining and Deep Learning
Towards Advanced Business Analytics using Text Mining and Deep LearningTowards Advanced Business Analytics using Text Mining and Deep Learning
Towards Advanced Business Analytics using Text Mining and Deep Learning
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Science
 
Techniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From ImagesTechniques Used For Extracting Useful Information From Images
Techniques Used For Extracting Useful Information From Images
 
Idm unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)
Idm  unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)Idm  unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)
Idm unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010
 
Report
ReportReport
Report
 
[IJET-V2I2P5] Authors:Mr. Veer Karan Bharat1, Miss. Dethe Pratima Vilas2, Mis...
[IJET-V2I2P5] Authors:Mr. Veer Karan Bharat1, Miss. Dethe Pratima Vilas2, Mis...[IJET-V2I2P5] Authors:Mr. Veer Karan Bharat1, Miss. Dethe Pratima Vilas2, Mis...
[IJET-V2I2P5] Authors:Mr. Veer Karan Bharat1, Miss. Dethe Pratima Vilas2, Mis...
 
Seams2016 presentation calikli_et_al
Seams2016 presentation calikli_et_alSeams2016 presentation calikli_et_al
Seams2016 presentation calikli_et_al
 
Video + Language: Where Does Domain Knowledge Fit in?
Video + Language: Where Does Domain Knowledge Fit in?Video + Language: Where Does Domain Knowledge Fit in?
Video + Language: Where Does Domain Knowledge Fit in?
 
Video + Language: Where Does Domain Knowledge Fit in?
Video + Language: Where Does Domain Knowledge Fit in?Video + Language: Where Does Domain Knowledge Fit in?
Video + Language: Where Does Domain Knowledge Fit in?
 
benchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social mediabenchmarking image retrieval diversification techniques for social media
benchmarking image retrieval diversification techniques for social media
 
Benoit Visual Only Retrieval
Benoit Visual Only RetrievalBenoit Visual Only Retrieval
Benoit Visual Only Retrieval
 

More from Gene Moo Lee

Developing A Big Data Analytics Framework for Industry Intelligence
Developing A Big Data Analytics Framework for Industry IntelligenceDeveloping A Big Data Analytics Framework for Industry Intelligence
Developing A Big Data Analytics Framework for Industry Intelligence
Gene Moo Lee
 
Improving Sketch Reconstruction Accuracy
Improving Sketch Reconstruction AccuracyImproving Sketch Reconstruction Accuracy
Improving Sketch Reconstruction Accuracy
Gene Moo Lee
 
Improving the Interaction between Overlay Routing and Traffic Engineering
Improving the Interaction between Overlay Routing and Traffic EngineeringImproving the Interaction between Overlay Routing and Traffic Engineering
Improving the Interaction between Overlay Routing and Traffic Engineering
Gene Moo Lee
 
Modeling Human Mobility using Location Based Social Networks
Modeling Human Mobility using Location Based Social NetworksModeling Human Mobility using Location Based Social Networks
Modeling Human Mobility using Location Based Social Networks
Gene Moo Lee
 
Mobile Video Delivery via Human Movement
Mobile Video Delivery via Human MovementMobile Video Delivery via Human Movement
Mobile Video Delivery via Human Movement
Gene Moo Lee
 
Towards modeling M&A in high tech industries
Towards modeling M&A in high tech industriesTowards modeling M&A in high tech industries
Towards modeling M&A in high tech industries
Gene Moo Lee
 

More from Gene Moo Lee (13)

Developing A Big Data Analytics Framework for Industry Intelligence
Developing A Big Data Analytics Framework for Industry IntelligenceDeveloping A Big Data Analytics Framework for Industry Intelligence
Developing A Big Data Analytics Framework for Industry Intelligence
 
Big Data Analytics: Challenges and Opportunities
Big Data Analytics: Challenges and OpportunitiesBig Data Analytics: Challenges and Opportunities
Big Data Analytics: Challenges and Opportunities
 
Analyzing the spillover roles of user-generated reviews on purchases: Evidenc...
Analyzing the spillover roles of user-generated reviews on purchases: Evidenc...Analyzing the spillover roles of user-generated reviews on purchases: Evidenc...
Analyzing the spillover roles of user-generated reviews on purchases: Evidenc...
 
Towards a better measure of business proximity: Topic modeling for industry i...
Towards a better measure of business proximity: Topic modeling for industry i...Towards a better measure of business proximity: Topic modeling for industry i...
Towards a better measure of business proximity: Topic modeling for industry i...
 
Designing Cybersecurity Policies with Field Experiments
Designing Cybersecurity Policies with Field ExperimentsDesigning Cybersecurity Policies with Field Experiments
Designing Cybersecurity Policies with Field Experiments
 
Introduction to NP Completeness
Introduction to NP CompletenessIntroduction to NP Completeness
Introduction to NP Completeness
 
Strategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social NetworkStrategic Network Formation in a Location-Based Social Network
Strategic Network Formation in a Location-Based Social Network
 
Matching Mobile Applications for Cross Promotion
Matching Mobile Applications for Cross PromotionMatching Mobile Applications for Cross Promotion
Matching Mobile Applications for Cross Promotion
 
Improving Sketch Reconstruction Accuracy
Improving Sketch Reconstruction AccuracyImproving Sketch Reconstruction Accuracy
Improving Sketch Reconstruction Accuracy
 
Improving the Interaction between Overlay Routing and Traffic Engineering
Improving the Interaction between Overlay Routing and Traffic EngineeringImproving the Interaction between Overlay Routing and Traffic Engineering
Improving the Interaction between Overlay Routing and Traffic Engineering
 
Modeling Human Mobility using Location Based Social Networks
Modeling Human Mobility using Location Based Social NetworksModeling Human Mobility using Location Based Social Networks
Modeling Human Mobility using Location Based Social Networks
 
Mobile Video Delivery via Human Movement
Mobile Video Delivery via Human MovementMobile Video Delivery via Human Movement
Mobile Video Delivery via Human Movement
 
Towards modeling M&A in high tech industries
Towards modeling M&A in high tech industriesTowards modeling M&A in high tech industries
Towards modeling M&A in high tech industries
 

Recently uploaded

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Content Complexity, Similarity, and Consistency in Social Media: A Deep Learning Approach

  • 1. Content Complexity, Similarity, and Consistency in Social Media: A Deep Learning Approach Gene Moo Lee University of Texas at Arlington Joint work with Donghyuk Shin (UT Austin/Amazon), Shu He (UConn), Andrew B. Whinston (UT Austin) DSI 2016, Austin TX
  • 3. Social media: More spending 3
  • 4. Challenges and opportunities: 78% photos 4 Source: Chang et al. 2014
  • 5. Research questions • How can firms optimize social media strategies by incorporating visual content? • Specifically, what are the determinants of consumer engagement in terms of “likes” and “reblogs” (sharing) actions? • How visual and textual contents play role? • Operationally, how to construct measures on these unstructured data sources? 5
  • 6. Tumblr data • Tumblr: microblogging platform (acquired by Yahoo!) • 35,651 posts by 183 companies (May - Oct 2014) • Automobile, Entertainment, Food, Fashion, Finance, Leisure, Retail, Tech • 89.7% photo & text, 6.3% pure text, 4% videos • Collected “likes” and “reblogs” until Apr 2015 6
  • 7. Company blogs in Tumblr 7 BMW USA Vogue IBM
  • 8. Data: blog post and engagement 8 Post = Visual Info (Image) + Textual Info (Text, Tags) Customer engagement = Notes (Likes + Reblogs)
  • 9. Visual features • Aesthetics (beautiful photos) • Adult-contents • Celebrity • Feature complexity (low-level, flashy images) • Semantic complexity (high-level, complex meaning) • Number of salient objects 9
  • 10. Feature complexity (low level) • Visual complexity theory [Donderi 2006a, Pieters et al. 2010] • Visually complex (flashy) images (colors, luminance, shape) gets more attention • This feature complexity can be captured by the image’s compressed file size [Donderi 2006a; Donderi 2006b; Machado et al. 2015; Forsythe et al. 2011] • However, this complexity can only capture low-level complexity based on “pixel” values 10
  • 11. Semantic complexity (high level) • Recognition-By-Components theory [Biederman 1987] • Human object recognition is invariant to feature factors (colors, brightness, edges, positions, etc.) • Vessel and Rubin (2010) show that visual preferences are influenced by semantic content in the image • We posit that semantic complexity matters! • Operational question: How do we calculate semantics from unstructured images? 11
  • 12. Deep learning • A branch of machine learning, inspired by human brain • Algorithms to model high-level abstractions with multiple processing layers of non-linear transformations • (1) theoretical breakthroughs, (2) Big Data, (3) powerful computation • Successfully applied in image/video/voice recognition, AlphaGo, etc. 12
  • 13. Semantic complexity via deep learning • Deep convolutional neural network (CNN) [Jia et al. 2014] • Model trained with 1.2 million images with tags (ImageNet, Flickr) • Tested on 53,417 images from brand-generated Tumblr posts • Each image is represented by a 1,700 dimensional vector, where each value is the confidence score w.r.t. an object (tag) • We define semantic complexity as the Shannon Diversity Index (entropy) on the 1,700-dimensional vector • max = log(d), if p is uniformly distributed • min = 0, if p_i = 1 for some i 13
  • 14. ImageNet: Image DB with tree-structure tags 14 Source: ImageNet
  • 15. More visual features • 7th-layer output = robust representation of the image for “computer vision” tasks • Aesthetic/beauty score [Dhar et al. 2011 (CVPR, Vision)] • Adult-content score [Sengamedu et al. 2011 (MM, Vision)] • Celebrity (450 celebrities) [Parhki et al. 2015 (BMV, Vision)] • Number of salient objects [Zhang et al. 2015 (CVPR, Vision)] 15
  • 16. Examples: Visual features • Visual complexity theory (Attneave 1994, Donderi 2006, Pieters et al. 2010) • Visual stimuli are a composite of colors,luminance, shape, number of objects/patterns 16
  • 17. Textual features • Two textual sources: text and tags • Length: # of words, # of tags • Topic complexity: LDA topic model (text, tags) • Order complexity: word2vec (for text only) 17
  • 18. Examples: Textual features • Topics • Word clusters 18
  • 19. Visual-Textual Content Similarity • Image: pixels, Text/Tags: characters — Need a common representation! 1. Represent each image as a collection of the predicted labels obtained from deep learning — “image corpus” 2. Train LDA with both image and text/tags corpora — topic distribution for images and text/tags 3. Cosine similarity between the two corresponding topic distribution 19
  • 20. Examples: Content similarity • Topics • Word clusters 20
  • 21. 21
  • 22. Empirical Model • Linear fixed effects model • DV (likes/reblogs): take log transformation due to their skewed distributions • Capture blog (firm) heterogeneity • Capture time effects (day of week, month) • Other models • Identical results with random effects • Consistent results with negative binomial model 22
  • 23. 23
  • 24. Summary and implications 1. Large-scale analysis on visual content in social media 2. New visual semantic complexity via deep learning • Able to relate visual and textual content Visual content analysis can be used to optimize content design for social media marketing 24
  • 25. Thank you! Contact Info: Gene Moo Lee gene.lee@uta.edu

Editor's Notes

  1. Industry subsample analysis Long- and short-term customer engagement Categorize posts/blogs into ‘utilitarian’ vs ‘hedonic’ Examine non-linear effects