SlideShare a Scribd company logo
1 of 23
Download to read offline
Diving into Twitter dataon consumer electronic brands
Which brands get tweeted about most? Is it mainly positive or negative?
15.3 gbof JSON data downloaded from Twitter’s Streaming API 
between 13 –25 May using Python
Before processing, tweets were in raw JSON format 
Time Created 
Tweet text/status 
Username 
Tweet location (if available) 
No. of followers 
No. of people followed 
No. of statuses 
Language 
Data should be optimized as only a fraction of the data used for analysis— optimization improves performance in models and saves cost and time
The same tweet we saw previously 
By optimizing the data, 
15.3 gbof jsonwas converted to 757 mbof csv (5% of original size) 
After processing, only some fields retained and converted to CSV format
Brand 
Positive Sentiment 
Brand 
Negative Sentiment 
Brand 
Mixed Sentiment 
The list of words for sentiment analysis is adapted from 
the Harvard General Inquirer dictionaries 
Source: http://www.wjh.harvard.edu/~inquirer/homecat.htm, downloaded on 28 May 2014 
Tweets are then tagged for brand and sentiment in R
Initially, collected tweets based on 17 keywords 
Samsung 
S4 
Xperia 
HTC 
Huawei 
BlackBerry 
Apple 
S5 
Sony 
Nokia 
Note 3 
Lumia 
q5 
iPhone 
q10 
z10 
Motorala
“Apple” and “iPhone” accounted for 87% of tweet volume 
Removed from keywords during actual data collection to focus on other brands (, save space, and reduce bandwidth usage) 
A trial was conducted with 16 keywords on 11 May, 8 –9am 
1 gbof JSON data was collected in a hour 
During a one hour trial, “Apple” and “iPhone” had 87% share of tweets
Samsung 
Sony 
Nokia 
HTC 
Huawei 
BlackBerry 
Motorola 
Tweets containing seven keywords were collected from 13 –25 May
4% of tweets mentioned > 2 brands; they were excluded from analysis 
8% of tweets had mixed sentiment (i.e., positive and negative sentiment); they were excluded from analysis 
92% of tweets remained, each only mentioning 1 brand with either “positive”, “negative”, or “neutral” sentiment 
3,681,942 tweets were collected 
After processing, 3,234,678 tweets remained for analysis
Samsung leads in twitter buzz, followed by Sony and Nokia 
Together, they make up 75% of twitter buzz 
Samsung is the clear leader in twitter buzz, followed by Sony and Nokia 
However, Samsung and Sony have wider product offerings relative to the rest that mainly focus on phones 
Also, Huawei’s users may mainly be on Weibo, Renren, etc
Most brands have roughly 1:1 ratio of positive to negative tweets 
Samsung is the exception with ratio of roughly 3:2 
Brands have equal ratio of positive to negative tweets
Dip due to connectivity issues 
Brands’ share of tweets is roughly consistent over time
Spikes in tweet volume coincide with product launches
Spikes in tweet volume coincide with product launches
Users who tweet about BlackBerry tend to be better connected (i.e., higher median of followers and people followed)* 
* Excluding outliers 
Across brands, there is not much difference in user connectedness 
The median user has around 250 followers and also follows 250 people
50th–75thpercentile of users who tweet about Sony, HTC, and Motorola have very high numbers of all time tweets (spam bots perhaps?)* 
While Nokia is 3rdin twitter buzz share (14%), users who tweet about Nokia have least numbers of all time tweets 
Suggests that tweets likely to come from real users and not bots (or maybe less active bots) 
* Excluding outliers 
However, there is a large difference between users’ all time tweets
12833979 
followers 
11796709 
followers 
CNN’s tweet on Obama’s BlackBerry was “seen” by most followers
1753696 tweets 
1730006 
tweets 
A bot that retweets on farts has the highest all time tweets
1753696 tweets 
1730006 
tweets 
A bot that retweets on farts has the highest all time tweets
Initially, BlackBerry tweets showed 100% negative sentiment 
Culprit was the word “lack”—it was removed 
However, removing it reduced negative sentiment for other brands by 2 –3 % 
An interesting error led to BlackBerry having 100% negative sentiment
Track brands’ managed twitter accounts and conversations to measure engagement 
Which brands have better engagement with users and why? 
Track general message of tweets 
Are tweets of a brand mainly about sales, reviews, complaints, or news? 
Network analysis to identify users with high centrality and influence 
Which users have high influence and what are they tweeting about my brand? 
Geospatial analysis of tweets 
Are there differences in brand buzz, sentiment, and engagement across regions? 
Where do we go from here?
Code available on GitHub: https://github.com/eugeneyan/Twitter-SMA 
Python script to download tweets in JSON format 
Python scripts to convert tweets from JSON to CSV (with & without regular expressions filtering) 
R script and sentiment analysis list of words 
R script and sentiment analysis list of words to reproduce BlackBerry error

More Related Content

Viewers also liked

Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Eugene Yan Ziyou
 
Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Eugene Yan Ziyou
 
AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)Eugene Yan Ziyou
 
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntKaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntEugene Yan Ziyou
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and DistributionEugene Yan Ziyou
 
Statistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsStatistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsEugene Yan Ziyou
 
Sharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaSharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaEugene Yan Ziyou
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionEugene Yan Ziyou
 

Viewers also liked (8)

Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)
 
Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...
 
AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)
 
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntKaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and Distribution
 
Statistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsStatistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-tests
 
Sharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaSharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at Lazada
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversion
 

Similar to Diving into Twitter data on consumer electronic brands

Making sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonMaking sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonNicolasSaintagne
 
Geek Retreat 2011
Geek Retreat 2011Geek Retreat 2011
Geek Retreat 2011BrandsEye
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter FeedsEu Jin Lok
 
Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Murilo Cappucci
 
How to get started with social media
How to get started with social mediaHow to get started with social media
How to get started with social mediaEric Ritter
 
Social Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSocial Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSally Falkow
 
Chase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensedChase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condenseddborstein
 
PeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr
 
Making Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneMaking Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneInfluence People
 
Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Lynne d Johnson
 
Twitter overview
Twitter overviewTwitter overview
Twitter overviewJason Cross
 
2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored SocialTed Murphy
 
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Arjoon Mehra
 
PeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr
 
Tips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyTips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyBrickfish
 

Similar to Diving into Twitter data on consumer electronic brands (20)

Making sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonMaking sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 London
 
Geek Retreat 2011
Geek Retreat 2011Geek Retreat 2011
Geek Retreat 2011
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter Feeds
 
Social 360
Social 360Social 360
Social 360
 
Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10
 
How to get started with social media
How to get started with social mediaHow to get started with social media
How to get started with social media
 
Web 2.0 sf 2011 metrics
Web 2.0 sf 2011 metricsWeb 2.0 sf 2011 metrics
Web 2.0 sf 2011 metrics
 
Social Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSocial Media Strategy PRSA NCC
Social Media Strategy PRSA NCC
 
Chase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensedChase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensed
 
PeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's Hot
 
Making Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneMaking Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas Saintagne
 
Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009
 
Coffee Twalk Three: Measurement and Analytics for Social Media
Coffee Twalk Three: Measurement and Analytics for Social MediaCoffee Twalk Three: Measurement and Analytics for Social Media
Coffee Twalk Three: Measurement and Analytics for Social Media
 
Gravity Summit 2010 PeopleBrowsr
Gravity Summit 2010 PeopleBrowsrGravity Summit 2010 PeopleBrowsr
Gravity Summit 2010 PeopleBrowsr
 
Twitter overview
Twitter overviewTwitter overview
Twitter overview
 
2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social
 
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
 
PeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl Deck
 
Tips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyTips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter Strategy
 
Coffee Twalk Two: Communication Strategies
Coffee Twalk Two: Communication StrategiesCoffee Twalk Two: Communication Strategies
Coffee Twalk Two: Communication Strategies
 

More from Eugene Yan Ziyou

System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and searchEugene Yan Ziyou
 
Recommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixRecommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixEugene Yan Ziyou
 
Predicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionPredicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionEugene Yan Ziyou
 
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsOLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsEugene Yan Ziyou
 
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Eugene Yan Ziyou
 
INSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyINSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyEugene Yan Ziyou
 
SMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceSMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceEugene Yan Ziyou
 
Culture at Lazada Data Science
Culture at Lazada Data ScienceCulture at Lazada Data Science
Culture at Lazada Data ScienceEugene Yan Ziyou
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Eugene Yan Ziyou
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveEugene Yan Ziyou
 
Social network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communitySocial network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communityEugene Yan Ziyou
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsEugene Yan Ziyou
 
A Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USA Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USEugene Yan Ziyou
 

More from Eugene Yan Ziyou (13)

System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and search
 
Recommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixRecommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrix
 
Predicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionPredicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admission
 
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsOLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
 
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
 
INSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyINSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my Journey
 
SMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceSMU BIA Sharing on Data Science
SMU BIA Sharing on Data Science
 
Culture at Lazada Data Science
Culture at Lazada Data ScienceCulture at Lazada Data Science
Culture at Lazada Data Science
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
 
Social network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communitySocial network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG community
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
 
A Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USA Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the US
 

Recently uploaded

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 

Recently uploaded (20)

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 

Diving into Twitter data on consumer electronic brands

  • 1. Diving into Twitter dataon consumer electronic brands
  • 2. Which brands get tweeted about most? Is it mainly positive or negative?
  • 3. 15.3 gbof JSON data downloaded from Twitter’s Streaming API between 13 –25 May using Python
  • 4. Before processing, tweets were in raw JSON format Time Created Tweet text/status Username Tweet location (if available) No. of followers No. of people followed No. of statuses Language Data should be optimized as only a fraction of the data used for analysis— optimization improves performance in models and saves cost and time
  • 5. The same tweet we saw previously By optimizing the data, 15.3 gbof jsonwas converted to 757 mbof csv (5% of original size) After processing, only some fields retained and converted to CSV format
  • 6. Brand Positive Sentiment Brand Negative Sentiment Brand Mixed Sentiment The list of words for sentiment analysis is adapted from the Harvard General Inquirer dictionaries Source: http://www.wjh.harvard.edu/~inquirer/homecat.htm, downloaded on 28 May 2014 Tweets are then tagged for brand and sentiment in R
  • 7. Initially, collected tweets based on 17 keywords Samsung S4 Xperia HTC Huawei BlackBerry Apple S5 Sony Nokia Note 3 Lumia q5 iPhone q10 z10 Motorala
  • 8. “Apple” and “iPhone” accounted for 87% of tweet volume Removed from keywords during actual data collection to focus on other brands (, save space, and reduce bandwidth usage) A trial was conducted with 16 keywords on 11 May, 8 –9am 1 gbof JSON data was collected in a hour During a one hour trial, “Apple” and “iPhone” had 87% share of tweets
  • 9. Samsung Sony Nokia HTC Huawei BlackBerry Motorola Tweets containing seven keywords were collected from 13 –25 May
  • 10. 4% of tweets mentioned > 2 brands; they were excluded from analysis 8% of tweets had mixed sentiment (i.e., positive and negative sentiment); they were excluded from analysis 92% of tweets remained, each only mentioning 1 brand with either “positive”, “negative”, or “neutral” sentiment 3,681,942 tweets were collected After processing, 3,234,678 tweets remained for analysis
  • 11. Samsung leads in twitter buzz, followed by Sony and Nokia Together, they make up 75% of twitter buzz Samsung is the clear leader in twitter buzz, followed by Sony and Nokia However, Samsung and Sony have wider product offerings relative to the rest that mainly focus on phones Also, Huawei’s users may mainly be on Weibo, Renren, etc
  • 12. Most brands have roughly 1:1 ratio of positive to negative tweets Samsung is the exception with ratio of roughly 3:2 Brands have equal ratio of positive to negative tweets
  • 13. Dip due to connectivity issues Brands’ share of tweets is roughly consistent over time
  • 14. Spikes in tweet volume coincide with product launches
  • 15. Spikes in tweet volume coincide with product launches
  • 16. Users who tweet about BlackBerry tend to be better connected (i.e., higher median of followers and people followed)* * Excluding outliers Across brands, there is not much difference in user connectedness The median user has around 250 followers and also follows 250 people
  • 17. 50th–75thpercentile of users who tweet about Sony, HTC, and Motorola have very high numbers of all time tweets (spam bots perhaps?)* While Nokia is 3rdin twitter buzz share (14%), users who tweet about Nokia have least numbers of all time tweets Suggests that tweets likely to come from real users and not bots (or maybe less active bots) * Excluding outliers However, there is a large difference between users’ all time tweets
  • 18. 12833979 followers 11796709 followers CNN’s tweet on Obama’s BlackBerry was “seen” by most followers
  • 19. 1753696 tweets 1730006 tweets A bot that retweets on farts has the highest all time tweets
  • 20. 1753696 tweets 1730006 tweets A bot that retweets on farts has the highest all time tweets
  • 21. Initially, BlackBerry tweets showed 100% negative sentiment Culprit was the word “lack”—it was removed However, removing it reduced negative sentiment for other brands by 2 –3 % An interesting error led to BlackBerry having 100% negative sentiment
  • 22. Track brands’ managed twitter accounts and conversations to measure engagement Which brands have better engagement with users and why? Track general message of tweets Are tweets of a brand mainly about sales, reviews, complaints, or news? Network analysis to identify users with high centrality and influence Which users have high influence and what are they tweeting about my brand? Geospatial analysis of tweets Are there differences in brand buzz, sentiment, and engagement across regions? Where do we go from here?
  • 23. Code available on GitHub: https://github.com/eugeneyan/Twitter-SMA Python script to download tweets in JSON format Python scripts to convert tweets from JSON to CSV (with & without regular expressions filtering) R script and sentiment analysis list of words R script and sentiment analysis list of words to reproduce BlackBerry error