SlideShare a Scribd company logo
Analysis of Twitter Social Media
Data
Submitted By
ABHIGYAN PANDEY
DATA ANALYSIS
For the given social media data we have 5000 articles posted by 2272 unique users in the time period of 3
days over 7 social media providers and the media providers are-
• Aggregator
• Buy/Sell
• Forum Posts
• Forum Replies
• Generic Blogs
• Mainstream Media
• Twitter
MEDIA_PROVIDER
Frequency Percent Valid Percent
Cumulative
Percent
Valid Aggregator 57 1.1 1.1 1.1
Buy/Sell 4 .1 .1 1.2
Forum Posts 70 1.4 1.4 2.6
Forum Replies 345 6.9 6.9 9.5
Generic Blogs 22 .4 .4 10.0
Mainstream Media 101 2.0 2.0 12.0
TWITTER 4401 88.0 88.0 100.0
Total 5000 100.0 100.0
Analysis of Updates and Sentiments
Row Labels Sum of UPDATES
Aggregator 0
Buy/Sell 0
Forum Posts 0
Forum Replies 0
Generic Blogs 0
Mainstream Media 0
TWITTER 160000982
Grand Total 160000982
Descriptive Statistics
N Minimum Maximum Mean Std. Deviation
UPDATES 5000 0 1033694 32000.20 88128.117
Valid N (listwise) 5000
Interpretations-
• Only Twitter has received the updates.
• Mean Updates is 32000(approx.) and maximum updates received to an article are 1033694.
• The Article who has received maximum update has the content “Samsung, Alibaba to team up on mobile
payment systems: Reports: Samsung Electronics and… https://t.co/xeKWYboocy SPS®”
BLOG_POST_SENTIMENT
Frequency Percent Valid Percent Cumulative Percent
Valid Neutral 4999 100.0 100.0 100.0
Positive 1 .0 .0 100.0
Total 5000 100.0 100.0
Media Providers Sentiments Updates
Aggregator Neutral 0
Buy/Sell Neutral 0
Forum Posts Neutral 0
Forum Replies Neutral 0
Generic Blogs Neutral 0
Mainstream Media Neutral
Positive
0
Twitter Neutral 160000982
Grand Total 160000982
Interpretations-
• We have two types of sentiments positive and neutral for the articles.
• Out of 5000 articles, we have neutral sentiment for 4999 articles and positive sentiment for 1 article.
• The article who has received positive sentiment has content “integrate it in future gadgets. NFC has
featured prominently in some smartphones, and popular services such as Apple Pay, Samsung Pay and
Google Wallet -- helping the technology gain traction among retailers. Fitbit rival Jawbone has already
tied up with American Express Co to let users pay through its UP4...”
Analysis of Media Providers used by
Unique Users
MEDIA_PROVIDER
Frequency Percent Valid Percent Cumulative Percent
Valid Aggregator 12 .5 .5 .5
Buy/Sell 1 .0 .0 .6
Forum Posts 57 2.5 2.5 3.1
Forum Replies 315 13.9 13.9 16.9
Generic Blogs 15 .7 .7 17.6
Mainstream Media 4 .2 .2 17.8
TWITTER 1868 82.2 82.2 100.0
Total 2272 100.0 100.0
1% 0% 2%
14%
1% 0%
82%
Aggregator
Buy/Sell
Forum Posts
Forum Replies
Generic Blogs
Mainstream Media
TWITTER
Analysis of Following
Row Labels Sum of FOLLOWING
Aggregator 0
Buy/Sell 0
Forum Posts 0
Forum Replies 0
Generic Blogs 0
Mainstream Media 0
TWITTER 1278320
Grand Total 1278320
Interpretations-
• We have followings only for Twitter.
• Out of 1868 Unique Twitter users 1789 users have followings in the interval 0-2500, and 54 users have
followings in the interval 2500-5000.
Analysis of Followers
Row Labels Sum of FOLLOWERS
Aggregator 0
Buy/Sell 0
Forum Posts 0
Forum Replies 0
Generic Blogs 0
Mainstream Media 0
TWITTER 12637437
Grand Total 12637437
Interpretations-
• We have followers only for Twitter.
• Out of 1868 Unique Twitter users 1829 users have followings in the interval 0-20000.
The Chi Square Test
To test the independency between followers and following variable-
Our Hypotheses are-
H0 - Following and Followers variable are independent to each other.
H1 - Following and Followers variable are not independent to each other.
Test Statistic-
= Chi-Square test of Independence
= Observed value of two nominal variables
= Expected value of two nominal variables
Where,
= Expected Value
=sum of the ith column
= Sum of the kth column
N = total number
TEST RESULT
Chi-Square Tests
Value df
Asymptotic Significance (2-
sided)
Pearson Chi-Square 939802.589a 718263 .000
N of Valid Cases 1868
a. 719960 cells (100.0%) have expected count less than 5. The minimum expected count is .00.
Calculated = 939802.589
Tabulated
= 720235.6
Calculated > Tabulated
Here,Chi square calculated value is greater than the chi square tabulated at 5% level of
Significance.
Hence we reject the null hypothesis and accept the alternative hypothesis , that means
Follower and following variable are not independent to each other.
Correlation Approach to find the
association
Following Followers
Following 1 0.039
Followers 0.039 1
Interpretations-
• Karl pearson correlation coefficient is for the given dataset between followers and following variable is 0.039.
• Since value is slightly greater than zero hence we can say there must be an association between these two
variable.
** Here I have calculated correlation by considering unique cases as there are so many duplicate cases in these columns.
Correlation among following, followers
and updates
Following Followers Updates
Following 1
Followers 0.022 1
Updates 0.103 0.046 1
Interpretations of Correlation Coefficient-
• +1 Indicates perfect positive correlation between variables.
• 0 Indicates no correlation between variables.
• -1 Indicates perfectly negative correlation between variables.
** Here I have calculated correlation by considering all the cases as we have different update for each articles.

More Related Content

Similar to Twitter Case Study

Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDatabricks
 
User Behavior Modeling on Financial Message Boards
User Behavior Modeling on Financial Message BoardsUser Behavior Modeling on Financial Message Boards
User Behavior Modeling on Financial Message Boardsprithan
 
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...Smarten Augmented Analytics
 
Analysing corporate performance with system dynamics
Analysing corporate performance with system dynamicsAnalysing corporate performance with system dynamics
Analysing corporate performance with system dynamicsDaniel Jarosch
 
Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...Giacomo Carozza
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPranov Mishra
 
Retail lessons learned from the first data driven business and future direct...
Retail  lessons learned from the first data driven business and future direct...Retail  lessons learned from the first data driven business and future direct...
Retail lessons learned from the first data driven business and future direct...TUSHAR GARG
 
Resolving Issues in the Energy Transition
Resolving Issues in the Energy TransitionResolving Issues in the Energy Transition
Resolving Issues in the Energy TransitionCrowdsourcing Week
 
Analysis of differential investor performance captstone presentation final
Analysis of differential investor  performance   captstone  presentation finalAnalysis of differential investor  performance   captstone  presentation final
Analysis of differential investor performance captstone presentation finalHoward Ho
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies台灣資料科學年會
 
Reporting and KPIs
Reporting and KPIsReporting and KPIs
Reporting and KPIsZin Sett
 
Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Sandy Donlon
 
Bazaarvoice Conversation Index 8
Bazaarvoice Conversation Index 8Bazaarvoice Conversation Index 8
Bazaarvoice Conversation Index 8Brett Hurt
 
The Conversation Index, volume 8
The Conversation Index, volume 8The Conversation Index, volume 8
The Conversation Index, volume 8Bazaarvoice
 
Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Eric Frey, M.B.A.
 
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign ManagementT-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign ManagementVivastream
 

Similar to Twitter Case Study (20)

Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn Creator
 
User Behavior Modeling on Financial Message Boards
User Behavior Modeling on Financial Message BoardsUser Behavior Modeling on Financial Message Boards
User Behavior Modeling on Financial Message Boards
 
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
 
Analysing corporate performance with system dynamics
Analysing corporate performance with system dynamicsAnalysing corporate performance with system dynamics
Analysing corporate performance with system dynamics
 
Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...Twitter, sentiment and finance: how qualitative information and markets are r...
Twitter, sentiment and finance: how qualitative information and markets are r...
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
Retail lessons learned from the first data driven business and future direct...
Retail  lessons learned from the first data driven business and future direct...Retail  lessons learned from the first data driven business and future direct...
Retail lessons learned from the first data driven business and future direct...
 
Brm (2) working
Brm (2)   workingBrm (2)   working
Brm (2) working
 
Social Media ROI
Social Media ROISocial Media ROI
Social Media ROI
 
Resolving Issues in the Energy Transition
Resolving Issues in the Energy TransitionResolving Issues in the Energy Transition
Resolving Issues in the Energy Transition
 
Analysis of differential investor performance captstone presentation final
Analysis of differential investor  performance   captstone  presentation finalAnalysis of differential investor  performance   captstone  presentation final
Analysis of differential investor performance captstone presentation final
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies
 
Auto mobile 2 wheeler sector PPT by BuzzAngles
Auto mobile 2 wheeler sector PPT by BuzzAnglesAuto mobile 2 wheeler sector PPT by BuzzAngles
Auto mobile 2 wheeler sector PPT by BuzzAngles
 
Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
 
Reporting and KPIs
Reporting and KPIsReporting and KPIs
Reporting and KPIs
 
Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8
 
Bazaarvoice Conversation Index 8
Bazaarvoice Conversation Index 8Bazaarvoice Conversation Index 8
Bazaarvoice Conversation Index 8
 
The Conversation Index, volume 8
The Conversation Index, volume 8The Conversation Index, volume 8
The Conversation Index, volume 8
 
Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8Bazaarvoice_Conversation_Index_Volume8
Bazaarvoice_Conversation_Index_Volume8
 
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign ManagementT-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
 

Recently uploaded

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单ewymefz
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportSatyamNeelmani2
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxzahraomer517
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单yhkoc
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxbenishzehra469
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单vcaxypu
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationBoston Institute of Analytics
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样axoqas
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatheahmadsaood
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单ewymefz
 

Recently uploaded (20)

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 

Twitter Case Study

  • 1. Analysis of Twitter Social Media Data Submitted By ABHIGYAN PANDEY
  • 2. DATA ANALYSIS For the given social media data we have 5000 articles posted by 2272 unique users in the time period of 3 days over 7 social media providers and the media providers are- • Aggregator • Buy/Sell • Forum Posts • Forum Replies • Generic Blogs • Mainstream Media • Twitter MEDIA_PROVIDER Frequency Percent Valid Percent Cumulative Percent Valid Aggregator 57 1.1 1.1 1.1 Buy/Sell 4 .1 .1 1.2 Forum Posts 70 1.4 1.4 2.6 Forum Replies 345 6.9 6.9 9.5 Generic Blogs 22 .4 .4 10.0 Mainstream Media 101 2.0 2.0 12.0 TWITTER 4401 88.0 88.0 100.0 Total 5000 100.0 100.0
  • 3. Analysis of Updates and Sentiments Row Labels Sum of UPDATES Aggregator 0 Buy/Sell 0 Forum Posts 0 Forum Replies 0 Generic Blogs 0 Mainstream Media 0 TWITTER 160000982 Grand Total 160000982 Descriptive Statistics N Minimum Maximum Mean Std. Deviation UPDATES 5000 0 1033694 32000.20 88128.117 Valid N (listwise) 5000 Interpretations- • Only Twitter has received the updates. • Mean Updates is 32000(approx.) and maximum updates received to an article are 1033694. • The Article who has received maximum update has the content “Samsung, Alibaba to team up on mobile payment systems: Reports: Samsung Electronics and… https://t.co/xeKWYboocy SPS®”
  • 4. BLOG_POST_SENTIMENT Frequency Percent Valid Percent Cumulative Percent Valid Neutral 4999 100.0 100.0 100.0 Positive 1 .0 .0 100.0 Total 5000 100.0 100.0 Media Providers Sentiments Updates Aggregator Neutral 0 Buy/Sell Neutral 0 Forum Posts Neutral 0 Forum Replies Neutral 0 Generic Blogs Neutral 0 Mainstream Media Neutral Positive 0 Twitter Neutral 160000982 Grand Total 160000982 Interpretations- • We have two types of sentiments positive and neutral for the articles. • Out of 5000 articles, we have neutral sentiment for 4999 articles and positive sentiment for 1 article. • The article who has received positive sentiment has content “integrate it in future gadgets. NFC has featured prominently in some smartphones, and popular services such as Apple Pay, Samsung Pay and Google Wallet -- helping the technology gain traction among retailers. Fitbit rival Jawbone has already tied up with American Express Co to let users pay through its UP4...”
  • 5. Analysis of Media Providers used by Unique Users MEDIA_PROVIDER Frequency Percent Valid Percent Cumulative Percent Valid Aggregator 12 .5 .5 .5 Buy/Sell 1 .0 .0 .6 Forum Posts 57 2.5 2.5 3.1 Forum Replies 315 13.9 13.9 16.9 Generic Blogs 15 .7 .7 17.6 Mainstream Media 4 .2 .2 17.8 TWITTER 1868 82.2 82.2 100.0 Total 2272 100.0 100.0 1% 0% 2% 14% 1% 0% 82% Aggregator Buy/Sell Forum Posts Forum Replies Generic Blogs Mainstream Media TWITTER
  • 6. Analysis of Following Row Labels Sum of FOLLOWING Aggregator 0 Buy/Sell 0 Forum Posts 0 Forum Replies 0 Generic Blogs 0 Mainstream Media 0 TWITTER 1278320 Grand Total 1278320 Interpretations- • We have followings only for Twitter. • Out of 1868 Unique Twitter users 1789 users have followings in the interval 0-2500, and 54 users have followings in the interval 2500-5000.
  • 7. Analysis of Followers Row Labels Sum of FOLLOWERS Aggregator 0 Buy/Sell 0 Forum Posts 0 Forum Replies 0 Generic Blogs 0 Mainstream Media 0 TWITTER 12637437 Grand Total 12637437 Interpretations- • We have followers only for Twitter. • Out of 1868 Unique Twitter users 1829 users have followings in the interval 0-20000.
  • 8. The Chi Square Test To test the independency between followers and following variable- Our Hypotheses are- H0 - Following and Followers variable are independent to each other. H1 - Following and Followers variable are not independent to each other. Test Statistic- = Chi-Square test of Independence = Observed value of two nominal variables = Expected value of two nominal variables
  • 9. Where, = Expected Value =sum of the ith column = Sum of the kth column N = total number
  • 10. TEST RESULT Chi-Square Tests Value df Asymptotic Significance (2- sided) Pearson Chi-Square 939802.589a 718263 .000 N of Valid Cases 1868 a. 719960 cells (100.0%) have expected count less than 5. The minimum expected count is .00. Calculated = 939802.589 Tabulated = 720235.6 Calculated > Tabulated Here,Chi square calculated value is greater than the chi square tabulated at 5% level of Significance. Hence we reject the null hypothesis and accept the alternative hypothesis , that means Follower and following variable are not independent to each other.
  • 11. Correlation Approach to find the association Following Followers Following 1 0.039 Followers 0.039 1 Interpretations- • Karl pearson correlation coefficient is for the given dataset between followers and following variable is 0.039. • Since value is slightly greater than zero hence we can say there must be an association between these two variable. ** Here I have calculated correlation by considering unique cases as there are so many duplicate cases in these columns.
  • 12. Correlation among following, followers and updates Following Followers Updates Following 1 Followers 0.022 1 Updates 0.103 0.046 1 Interpretations of Correlation Coefficient- • +1 Indicates perfect positive correlation between variables. • 0 Indicates no correlation between variables. • -1 Indicates perfectly negative correlation between variables. ** Here I have calculated correlation by considering all the cases as we have different update for each articles.