SlideShare a Scribd company logo
1 of 19
Determining online brand reputation with
machine learning from social media mentions: A
study in the banking context
Anette Rantanen1, Joni Salminen1,3, Filip Ginter2, Bernard J. Jansen3
1Turku School of Economics, 2University of Turku, 3Qatar Computing
Research Institute
Outline
1. Research purpose
2. Research design
3. Data collection
4. Training data annotation
5. Convolutional neural network
6. Results
7. Evaluation
8. Improvements
Research purpose
The purpose of this research is to develop and test a machine
learning model for automatic classification of online brand
reputation. Earlier machine learning efforts have mostly focused on
simple sentiment analysis, classifying positive, neutral and negative
brand mentions (e.g., Cambria, Schuller, Xia, & Havasi, 2013).
However, brand reputation is a much more complex construct,
including concepts, such as trust, quality, relatability, and other
psychologically advanced dimensions (Aaker, 1997).
Research design
To create a robust classification scheme, we first extensively review
the existing literature and then apply a convolutional neural network
(CNN), to detect and classify these brand dimensions from real
consumer discussions. We validate our approach by classifying social
media mentions for two organizations in the banking sector.
Data collection
• 2 large Finnish banks
• 2 large online social networks (> 10M messages)
Data collection
• 2 large Finnish banks
• 2 large online social networks (> 10M messages)
Data collection
• 2 large Finnish banks
• 2 large online social networks (> 10M messages)
Classification framework
Multidimensional reputation scales include Fortune Magazine’s annual
AMAC index, reputation quotient scale (RQ) by Fombrun, Gardberg, &
Sever (2013) and Walsh and Beatty's (2007) customer-based
reputation scale (CBR). The AMAC index focuses both on corporate
management’s point of view and customer perspective, while the
reputation quotient scale also takes into account other corporate
stakeholders such as employees. We adopt a perspective according to
which online brand reputation is a) a multi-dimensional construct, b)
defined to a large extent by customers. We combine constructs from
prior literature into a 6-dimensional classification scheme.
Classification framework
Training data annotation
In total, we retrieve 18,807 social media posts. Two research assistants
and one of the authors then code a 2,057 randomly sampled posts, so
that each post is classified either to one of the defined categories or to
a neutral category. To validate the manual coding, we calculate Fleiss’
Kappa and find a satisfactory agreement (k = 0.61). The manual coding
is a necessary step for generating training data for the supervised
machine learning model.
Convolutional Neural Network
Results
Evaluation
• Precision = Number of true
positives / Total number of
predicted positives
• Recall = Number of true
positives / Actual number
of positives
• F1 = Composite metric of
the above
Evaluation results
1/13 = 0.07
Improvements
• Simplification of classification  open coding
• Theoretically complex frameworks fit poorly into reality of people in
social media  what is an “innovative brand”?
• Additional training data  class balance and representativeness
• Trying out other machine learning models (e.g., random forest,
XGBoost)
Thank you!
• Dr. Joni Salminen, joolsa@utu.fi
Interpretation
As seen from Table 3, the strongest classes are neutral, agreeable(+) and
quality(+). Precision is strong in the class responsible(-), but the recall is
weaker, i.e. there was actually an even larger number of comments that
should have hit in this category, but they were categorized incorrectly into a
neutral category. For quality(+), the situation was the opposite. The machine
predicted quality(+) to have more comments than it actually did, so the
precision of the class was weaker than the recall. The quality(+) comments
seem to have been particularly mixed with agreeable(+) comments, which is
not surprising because there was some difficulty in distinguishing the two
even in the manual coding. The weakest classes are reliable(-), quality(-),
responsible(+), and innovative(+). There is only little training data for the
responsible(+) category, so reliable conclusions cannot be drawn.

More Related Content

What's hot

Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...Melissa Moody
 
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA TechniqueIRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA TechniqueIRJET Journal
 
Computing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback CommentsComputing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback CommentsIRJET Journal
 
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEME
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEMEA FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEME
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEMEaciijournal
 
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...IRJET Journal
 
Online brand Communities as Networks Engagement and Reach
Online brand Communities as Networks Engagement and ReachOnline brand Communities as Networks Engagement and Reach
Online brand Communities as Networks Engagement and ReachScott Sanders
 
IRJET- Analyzing Sentiments in One Go
IRJET-  	  Analyzing Sentiments in One GoIRJET-  	  Analyzing Sentiments in One Go
IRJET- Analyzing Sentiments in One GoIRJET Journal
 
Can you trust online ratings a mutual
Can you trust online ratings a mutualCan you trust online ratings a mutual
Can you trust online ratings a mutualAbhishek Kb
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET Journal
 
Framework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review DatasetFramework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review Datasetrahulmonikasharma
 
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...IJECEIAES
 
Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...IOSR Journals
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Absolutdata Analytics
 
Implicit vs. Explicit Trust in Social Matrix Factorization
Implicit vs. Explicit Trust in Social Matrix Factorization Implicit vs. Explicit Trust in Social Matrix Factorization
Implicit vs. Explicit Trust in Social Matrix Factorization Soudé Fazeli
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationAlejandro Bellogin
 
An Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A SurveyAn Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A SurveyIRJET Journal
 

What's hot (19)

Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...Automatic detection of online abuse and analysis of problematic users in wiki...
Automatic detection of online abuse and analysis of problematic users in wiki...
 
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA TechniqueIRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
 
Computing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback CommentsComputing Ratings and Rankings by Mining Feedback Comments
Computing Ratings and Rankings by Mining Feedback Comments
 
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEME
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEMEA FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEME
A FRAMEWORK FOR SUMMARIZATION OF ONLINE OPINION USING WEIGHTING SCHEME
 
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
 
Online brand Communities as Networks Engagement and Reach
Online brand Communities as Networks Engagement and ReachOnline brand Communities as Networks Engagement and Reach
Online brand Communities as Networks Engagement and Reach
 
IRJET- Analyzing Sentiments in One Go
IRJET-  	  Analyzing Sentiments in One GoIRJET-  	  Analyzing Sentiments in One Go
IRJET- Analyzing Sentiments in One Go
 
Can you trust online ratings a mutual
Can you trust online ratings a mutualCan you trust online ratings a mutual
Can you trust online ratings a mutual
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
 
Framework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review DatasetFramework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review Dataset
 
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...
MTVRep: A movie and TV show reputation system based on fine-grained sentiment ...
 
Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...Multidirectional Product Support System for Decision Making In Textile Indust...
Multidirectional Product Support System for Decision Making In Textile Indust...
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
K1802056469
K1802056469K1802056469
K1802056469
 
Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...
 
Implicit vs. Explicit Trust in Social Matrix Factorization
Implicit vs. Explicit Trust in Social Matrix Factorization Implicit vs. Explicit Trust in Social Matrix Factorization
Implicit vs. Explicit Trust in Social Matrix Factorization
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
An Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A SurveyAn Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A Survey
 

Similar to Determining Online Brand Reputation with Machine Learning from Social Media Mentions: A Study in the Banking Context

IRJET - Sentiment Analysis and Rumour Detection in Online Product Reviews
IRJET -  	  Sentiment Analysis and Rumour Detection in Online Product ReviewsIRJET -  	  Sentiment Analysis and Rumour Detection in Online Product Reviews
IRJET - Sentiment Analysis and Rumour Detection in Online Product ReviewsIRJET Journal
 
Extracting Business Intelligence from Online Product Reviews
Extracting Business Intelligence from Online Product Reviews  Extracting Business Intelligence from Online Product Reviews
Extracting Business Intelligence from Online Product Reviews ijsc
 
An Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation GenerationAn Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation GenerationKayla Jones
 
A Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering TechniquesA Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering Techniquestengyue5i5j
 
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15MLconf
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics DomainDrjabez
 
Mb0047 management information system
Mb0047   management information systemMb0047   management information system
Mb0047 management information systemsmumbahelp
 
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...IRJET Journal
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docxKevalKabariya
 
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docx
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docxENGIN 36 Final Exam Take Home Exam May 15, 2020 .docx
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docxkhanpaulita
 
Buddi health class imbalance based deep learning
Buddi health   class imbalance based deep learningBuddi health   class imbalance based deep learning
Buddi health class imbalance based deep learningRam Swaminathan
 
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...IJDKP
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsEditor IJCATR
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...IJTET Journal
 
Application of AI in customer relationship management
Application of AI in customer relationship managementApplication of AI in customer relationship management
Application of AI in customer relationship managementShashwat Shankar
 
B510519.pdf
B510519.pdfB510519.pdf
B510519.pdfaijbm
 
Online review mining for forecasting sales
Online review mining for forecasting salesOnline review mining for forecasting sales
Online review mining for forecasting saleseSAT Publishing House
 
Online review mining for forecasting sales
Online review mining for forecasting salesOnline review mining for forecasting sales
Online review mining for forecasting saleseSAT Journals
 

Similar to Determining Online Brand Reputation with Machine Learning from Social Media Mentions: A Study in the Banking Context (20)

IRJET - Sentiment Analysis and Rumour Detection in Online Product Reviews
IRJET -  	  Sentiment Analysis and Rumour Detection in Online Product ReviewsIRJET -  	  Sentiment Analysis and Rumour Detection in Online Product Reviews
IRJET - Sentiment Analysis and Rumour Detection in Online Product Reviews
 
Extracting Business Intelligence from Online Product Reviews
Extracting Business Intelligence from Online Product Reviews  Extracting Business Intelligence from Online Product Reviews
Extracting Business Intelligence from Online Product Reviews
 
An Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation GenerationAn Unsupervised Approach For Reputation Generation
An Unsupervised Approach For Reputation Generation
 
A Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering TechniquesA Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering Techniques
 
Sentiment analysis on unstructured review
Sentiment analysis on unstructured reviewSentiment analysis on unstructured review
Sentiment analysis on unstructured review
 
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15
Mark Zangari, CEO, Quantellia at MLconf SEA - 5/01/15
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 
Mb0047 management information system
Mb0047   management information systemMb0047   management information system
Mb0047 management information system
 
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...
Twitter Text Sentiment Analysis: A Comparative Study on Unigram and Bigram Fe...
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docx
 
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docx
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docxENGIN 36 Final Exam Take Home Exam May 15, 2020 .docx
ENGIN 36 Final Exam Take Home Exam May 15, 2020 .docx
 
Buddi health class imbalance based deep learning
Buddi health   class imbalance based deep learningBuddi health   class imbalance based deep learning
Buddi health class imbalance based deep learning
 
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...
FHCC: A SOFT HIERARCHICAL CLUSTERING APPROACH FOR COLLABORATIVE FILTERING REC...
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
Report
ReportReport
Report
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
 
Application of AI in customer relationship management
Application of AI in customer relationship managementApplication of AI in customer relationship management
Application of AI in customer relationship management
 
B510519.pdf
B510519.pdfB510519.pdf
B510519.pdf
 
Online review mining for forecasting sales
Online review mining for forecasting salesOnline review mining for forecasting sales
Online review mining for forecasting sales
 
Online review mining for forecasting sales
Online review mining for forecasting salesOnline review mining for forecasting sales
Online review mining for forecasting sales
 

More from Joni Salminen

Automatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesAutomatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesJoni Salminen
 
Five NLP Challenges in Data-Driven Personas
Five NLP Challenges in Data-Driven PersonasFive NLP Challenges in Data-Driven Personas
Five NLP Challenges in Data-Driven PersonasJoni Salminen
 
Problem of majority voting
Problem of majority votingProblem of majority voting
Problem of majority votingJoni Salminen
 
Persona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadPersona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadJoni Salminen
 
Enriching social media personas with personality traits
Enriching social media personas with personality traitsEnriching social media personas with personality traits
Enriching social media personas with personality traitsJoni Salminen
 
User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?Joni Salminen
 
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Joni Salminen
 
Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Joni Salminen
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...Joni Salminen
 
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesIs More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesJoni Salminen
 
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Joni Salminen
 
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingOSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingJoni Salminen
 
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Joni Salminen
 
Tips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasTips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasJoni Salminen
 
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationBig Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationJoni Salminen
 
Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Joni Salminen
 
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceSocial Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceJoni Salminen
 
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 

More from Joni Salminen (20)

Automatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesAutomatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current Challenges
 
Five NLP Challenges in Data-Driven Personas
Five NLP Challenges in Data-Driven PersonasFive NLP Challenges in Data-Driven Personas
Five NLP Challenges in Data-Driven Personas
 
Problem of majority voting
Problem of majority votingProblem of majority voting
Problem of majority voting
 
Persona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadPersona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road Ahead
 
Enriching social media personas with personality traits
Enriching social media personas with personality traitsEnriching social media personas with personality traits
Enriching social media personas with personality traits
 
User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?
 
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
 
Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
 
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesIs More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
 
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
 
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingOSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
 
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
 
Tips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasTips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic Personas
 
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationBig Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
 
Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Why do startups avoid difficult problems?
Why do startups avoid difficult problems?
 
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceSocial Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
 
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
 
Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)
 
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
 

Recently uploaded

Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Doge Mining Website
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in PhilippinesDavidSamuel525586
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationAnamaria Contreras
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxFinancial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxsaniyaimamuddin
 
Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Anamaria Contreras
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...ssuserf63bd7
 
1911 Gold Corporate Presentation Apr 2024.pdf
1911 Gold Corporate Presentation Apr 2024.pdf1911 Gold Corporate Presentation Apr 2024.pdf
1911 Gold Corporate Presentation Apr 2024.pdfShaun Heinrichs
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxappkodes
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
Chapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditChapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditNhtLNguyn9
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdfKhaled Al Awadi
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyotictsugar
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03DallasHaselhorst
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 

Recently uploaded (20)

Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in Philippines
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement Presentation
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxFinancial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
 
Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...
 
1911 Gold Corporate Presentation Apr 2024.pdf
1911 Gold Corporate Presentation Apr 2024.pdf1911 Gold Corporate Presentation Apr 2024.pdf
1911 Gold Corporate Presentation Apr 2024.pdf
 
Call Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North GoaCall Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North Goa
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptx
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
Chapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditChapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal audit
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyot
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 

Determining Online Brand Reputation with Machine Learning from Social Media Mentions: A Study in the Banking Context

  • 1. Determining online brand reputation with machine learning from social media mentions: A study in the banking context Anette Rantanen1, Joni Salminen1,3, Filip Ginter2, Bernard J. Jansen3 1Turku School of Economics, 2University of Turku, 3Qatar Computing Research Institute
  • 2. Outline 1. Research purpose 2. Research design 3. Data collection 4. Training data annotation 5. Convolutional neural network 6. Results 7. Evaluation 8. Improvements
  • 3. Research purpose The purpose of this research is to develop and test a machine learning model for automatic classification of online brand reputation. Earlier machine learning efforts have mostly focused on simple sentiment analysis, classifying positive, neutral and negative brand mentions (e.g., Cambria, Schuller, Xia, & Havasi, 2013). However, brand reputation is a much more complex construct, including concepts, such as trust, quality, relatability, and other psychologically advanced dimensions (Aaker, 1997).
  • 4. Research design To create a robust classification scheme, we first extensively review the existing literature and then apply a convolutional neural network (CNN), to detect and classify these brand dimensions from real consumer discussions. We validate our approach by classifying social media mentions for two organizations in the banking sector.
  • 5. Data collection • 2 large Finnish banks • 2 large online social networks (> 10M messages)
  • 6. Data collection • 2 large Finnish banks • 2 large online social networks (> 10M messages)
  • 7. Data collection • 2 large Finnish banks • 2 large online social networks (> 10M messages)
  • 8. Classification framework Multidimensional reputation scales include Fortune Magazine’s annual AMAC index, reputation quotient scale (RQ) by Fombrun, Gardberg, & Sever (2013) and Walsh and Beatty's (2007) customer-based reputation scale (CBR). The AMAC index focuses both on corporate management’s point of view and customer perspective, while the reputation quotient scale also takes into account other corporate stakeholders such as employees. We adopt a perspective according to which online brand reputation is a) a multi-dimensional construct, b) defined to a large extent by customers. We combine constructs from prior literature into a 6-dimensional classification scheme.
  • 10. Training data annotation In total, we retrieve 18,807 social media posts. Two research assistants and one of the authors then code a 2,057 randomly sampled posts, so that each post is classified either to one of the defined categories or to a neutral category. To validate the manual coding, we calculate Fleiss’ Kappa and find a satisfactory agreement (k = 0.61). The manual coding is a necessary step for generating training data for the supervised machine learning model.
  • 13.
  • 14.
  • 15. Evaluation • Precision = Number of true positives / Total number of predicted positives • Recall = Number of true positives / Actual number of positives • F1 = Composite metric of the above
  • 17. Improvements • Simplification of classification  open coding • Theoretically complex frameworks fit poorly into reality of people in social media  what is an “innovative brand”? • Additional training data  class balance and representativeness • Trying out other machine learning models (e.g., random forest, XGBoost)
  • 18. Thank you! • Dr. Joni Salminen, joolsa@utu.fi
  • 19. Interpretation As seen from Table 3, the strongest classes are neutral, agreeable(+) and quality(+). Precision is strong in the class responsible(-), but the recall is weaker, i.e. there was actually an even larger number of comments that should have hit in this category, but they were categorized incorrectly into a neutral category. For quality(+), the situation was the opposite. The machine predicted quality(+) to have more comments than it actually did, so the precision of the class was weaker than the recall. The quality(+) comments seem to have been particularly mixed with agreeable(+) comments, which is not surprising because there was some difficulty in distinguishing the two even in the manual coding. The weakest classes are reliable(-), quality(-), responsible(+), and innovative(+). There is only little training data for the responsible(+) category, so reliable conclusions cannot be drawn.