SlideShare a Scribd company logo
1 of 23
Sharing News Articles Using 140 Characters:
     A Propagation Analysis on Twitter
                         Dr. Sudha Ram
      McClelland Professor of MIS and Computer Science
                 Eller College of Management
                      University of Arizona
                        Tucson, AZ 86721
                Email: ram@eller.arizona.edu
                          April 2, 2013
     INSITE: Center for Business Intelligence and Analytics
                        www.insiteua.org

              This is joint work with Devi Bhattacharya

                                                   Friday, April 05, 2013
Motivation
Twitter – Important hub of Social Media
         140 million members1
         340 million tweets/day1
Evolution of Twitter as a serious
 newswire - Credible news appearing on
 it before anywhere else on the web2
         Iran election results controversy
         Egyptian revolution
  1 - as of March, 2012 via http://blog.twitter.com/2012/03/twitter-turns-six.html
  2 - http://blog.twitter.com/2007/09/twitter-for-news.html, http://blog.twitter.com/2008/07/twitter-as-news-wire.html,
  http://www.sysomos.com/insidetwitter/engagement/



Friday, April 05, 2013
Research Objective
  “Understanding and comparing the influence of news
  agency brands in a micro-blogging environment using
                   network analysis”
 Measuring the news article cascades caused by Twitter
  users’ participation
 Investigating Twitter news agency Followers’ involvement
  in the propagation process

                               Theory development for model
   Identifying and producing
                                  of online news article
    new network measures
                                       propagation


4/5/2013 4:34 AM
Our Initial Questions
 How do different news sources compare in
  volume and extent of spread of news articles?

 How do the articles from different news sources
  spread and survive over time and what is their
  lifespan?

 How do rates of spread compare across different
  news sources?


Friday, April 05, 2013
Data Collection
Dataset of tweets (21st Nov till 13th Dec,
 2011)
        Tweet containing valid URL of articles from
         selected news sources.
        Excludes tweets containing a URL that refers to
         the news source homepage
        URL Minimum reach 5 or more (tweets,re-
         tweeted/posted)
        6 Million tweets
Via Twitter Streaming API and Phirehose
 application
Friday, April 05, 2013
Twitter Propagation Network
 A weighted user-user network : G = (V, E, W)
                                       s
                              v1                 v2
  where,
    V : the set of nodes, representing the users on Twitter who
     tweet/retweet about a news media article
    E : the set of edges, signifying two users who are linked via
     tweet-retweet/reply relationship
    W : the normalized edge weight representing the strength
     of the tweet-retweet/reply relationship. (Strength s = Number
          of times v2 retweets/replies v1)
  This network is named as Twitter Activity Network (TAN)
 Captures the user participation in news propagation for the
  entire time period
4/5/2013 4:34 AM
Example of Twitter Activity Network



 NYTimes TAN for
 1 week activity
 #Nodes : 58,945
 #Edges : 29,676




4/5/2013 4:34 AM                   EGO
Example of TAN: WaPo


    Washington Post




 Non-media seeding node
 cascade streams

                          Disconnected network section – single user nodes
                          capturing tweets with no response
Friday, April 05, 2013
QUESTION 1

 How do different news sources
 compare in volume and extent of
 spread of news articles?
  Nodes volume and edge/node ratio
   comparison using TAN
  Ego network node analysis


Friday, April 05, 2013
Diffusion Network Statistics
    News Source     Media Seeding       Nodes      Edges     Diameter Edge/Node
                        Nodes                                              Ratio
BBC                bbcworld,           3801581 154276 341               0.406
                                               Higher number of        Lowest
                   bbcbreaking,                Bloomberg, NPR, Washington
                                                 long cascade         diameter         High
     NYTimes Vs. bbcnews                       Post, Forbes, and Wired : High edge/node
                                                    streams
Reuters            reuters           Sparse #
                                       61440     27035 non-media nodes 1
   Washington Post
                               retweets/replies –
                                                   % of 8               0.44
                                                                                       ratio
Guardian           guardiannews        21454     9250creating cascades
                                                             3          0.4312
                             High concentration of
NYTimes as information
   Acting          nytimes             1556503
                                individual tweets63436       143        0.4084
NPR boosters for news
       Financial each
                   nprnews             40454     16459       4          0.407
           other
          agencies
Washington Post washingtonpost 982474            40791 network's three major seedingLow
                                                      BBC 152           0.4153
                                                                                    edge/node
                                                      nodes – bbcworld, bbcbreaking and
FT                 ft                  10242     2925        4 long
                                                           Fewer bbcnews
                                                                        0.286         ratios
              Information spread
Forbes       concentrated at levels 88630
                   forbes                        28718       9
                                                            cascade     0.324
Bloomberg closerbloombergnews 26487
                    to the seeding               8719       streams.
                                                             104        0.33
Arstechnica           node
                   arstechnica         13313     5016        3          0.377
Mashable                 mashable   1971092    78620      8           0.398
Wired                    wired      43744      14786      9           0.338

                                     Superscript values indicate rank for that column
Friday, April 05, 2013
Example of Twitter Ego Network



 NYTimes Ego
 Network for 1
 week activity
 #Nodes : 22,176
 #Edges : 23,081

 Ego Network ⊆
 TAN




4/5/2013 4:34 AM
                                        TAN
Analyzing News Cascades – Diffusion Depth
                            Day-wise Comparison of Average Diffusion Depth
                   8      BBC News                                   Guardian –   M
                                                 NYTimes –
                   7       – 5.43                  4.00                2.14
                                                                                  Tu
 Diffusion Depth




                   6              Washington
                                            FT – 1.71
                                  Post – 4.00                                     W
                   5                                         Arstechnica
                                                                – 2.29            Th
                   4

                   3                                                              F

                   2                                                              Sa
                   1                                                              Su
                   0
                                                                                  Avg
                       Lowest Average                        Highest Average
                       Weekly Diffusion                      Weekly Diffusion
                           Depth                                 Depth
        4/5/2013 4:34 AM
QUESTION 2

  How do the articles from different
  news sources spread and survive over
  time and what is their lifespan?
  Analyzing Lifespan: Time difference
   between the last and first tweet posted
   containing the URL to an article
  Examining article survival distributions

Friday, April 05, 2013
Life Span of Articles on Twitter
                                 Shortest Lifespan
                                 Longest Lifespan


              NYTimes




           Bloomberg

                         Wired
            Forbes                                   Mashable      BBC




    • Y-Axis: Count of articles surviving
    • X-Axis: Time Progression from first tweet submission time (In Hours)

Friday, April 05, 2013
QUESTION 3

  How do rates of spread compare
  across different news sources analyzed
  in comparison with tweet posting
  activity?
   Concept of URL half-life
   Cumulative and non-cumulative trends

Friday, April 05, 2013
Cumulative Number of Tweets Posted
                           News Sources Exceeding                     Forbes
                                    BBC
                              Half-Life - Fair



 News Sources Exceeding Half-Life -
                                                    Mashable
          Most Popular



                                                         Washington     Wired
                 NYTimes         Reuters                    Post




 • Y-Axis: Cumulative number of tweets posted for a given article
 • X-Axis: Time Progression from first tweet submission time (In Hours)
 • Tweets Frequency restricted between 0-400 and Time Progression to 72 hours

Friday, April 05, 2013
Summary of Findings
•   BBC
     –    Maximum reach in terms of affected users, retweet levels and distribution of Twitter
          users in tweet-retweet ladders.
     –    Best article survival chances, with 0.1% of articles surviving for three days or more
     –    Median lifespan of article is 55 hours from first submission.
     –    Majorly supported by 2 other BBC Twitter accounts – bbcbreaking and bbcworld.
•   Reuters
     –    Average article lifespan and rate of spread
     –    Best edge/node ratio - indicating highest percentage of user interactions via retweets
          and replies
•   NYTimes and Mashable
     –    Similar tweets volume and rate of spread
     –    Mashable - high percentage of level 1 tweets and NYTimes - tweet-retweet cascades
          resulting in a diameter of 14 levels.
•   Guardian
     –    Most of the tweets in its ego network are concentrated in the first level
     –    Second highest edge/node ratio.
     –    Highest median number of tweets posted in the first hour
     –    Third highest rate of spread
                                                                                  Friday, April 05, 2013
Applying the network measures for news
          agency comparison
  Considering the collective influence of the network measures
   on news agency article propagation

                              Diffusion depth and
                              Diffusion magnitude
                                  appear to be
                                inversely related
                                    Follower engagement
                                        seems directly
                             News Agencies with
                          Involvementproportional to
                                        of followers
                            high tweeting activity
                           is more important magnitude
                                     Diffusion than
                          the number diffusion
                              have high of followers
                                     depth




 4/5/2013 4:34 AM
Implications
 Exhaustive analysis of news media agency related
  tweets
 Leads for developing diffusion strategy to maximize
  audience reach
        Magnitude Vs. Depth
 Identification of useful metrics to evaluate
  propagation
      Diffusion          Follower Participation,   Lifespan and
      Diameter           User Volume               related metrics
      Edge/Node Node composition at                Cumulative and
      Ratio, Reach different levels of ego         non-cumulative rate
                   network                         of spread

Friday, April 05, 2013
NPR Video
Click here to view the NPR news item on
our work:
http://www.insiteua.org/news/Twitter_Be
comes_News_Media_Tool.asp




4/5/2013                                  20
Publication Reference
For more information please see our
research paper:
Devipsita Bhattacharya and Sudha Ram, “Sharing
News Article Using 140 Characters: A Diffusion
Analysis on Twitter”, IEEE 2012 International
Conference on Advances in Social Network Analysis
and Mining (ASONAM 2012), pp, 966-971.
http://www.computer.org/csdl/proceeding
s/asonam/2012/4799/00/4799a966-
abs.html

4/5/2013                                            21
Harnessing the Power of Social Media Using
                 INSITE




   Facebook ID: BusinessIntelligenceAndAnalyticsCenter   Twitter ID: insite_ua

4/5/2013 4:34 AM
Questions




     INSITE: Center for Business Intelligence and Analytics
                    URL: www.insiteua.org

           Contact Information: ram@eller.arizona.edu


4/5/2013                                                      23

More Related Content

Similar to News Article Propagation Analysis on Twitter

Enhancing prediction of user stance for social networks rumors
Enhancing prediction of user stance for social networks rumorsEnhancing prediction of user stance for social networks rumors
Enhancing prediction of user stance for social networks rumorsIJECEIAES
 
Conducting Twitter Reserch
Conducting Twitter ReserchConducting Twitter Reserch
Conducting Twitter ReserchKim Holmberg
 
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Steve Kramer
 
Rob Procter
Rob ProcterRob Procter
Rob ProcterNSMNSS
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Serge Beckers
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Serge Beckers
 
Secure and distributed data discovery and dissemination in wireless sensor ne...
Secure and distributed data discovery and dissemination in wireless sensor ne...Secure and distributed data discovery and dissemination in wireless sensor ne...
Secure and distributed data discovery and dissemination in wireless sensor ne...Pvrtechnologies Nellore
 
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONIEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONranjith kumar
 
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONIEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONranjith kumar
 

Similar to News Article Propagation Analysis on Twitter (9)

Enhancing prediction of user stance for social networks rumors
Enhancing prediction of user stance for social networks rumorsEnhancing prediction of user stance for social networks rumors
Enhancing prediction of user stance for social networks rumors
 
Conducting Twitter Reserch
Conducting Twitter ReserchConducting Twitter Reserch
Conducting Twitter Reserch
 
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
 
Rob Procter
Rob ProcterRob Procter
Rob Procter
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?
 
Secure and distributed data discovery and dissemination in wireless sensor ne...
Secure and distributed data discovery and dissemination in wireless sensor ne...Secure and distributed data discovery and dissemination in wireless sensor ne...
Secure and distributed data discovery and dissemination in wireless sensor ne...
 
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONIEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE BE-BTECH NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
 
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTIONIEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
IEEE NS2 PROJECT@ DREAMWEB TECHNO SOLUTION
 

Recently uploaded

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

News Article Propagation Analysis on Twitter

  • 1. Sharing News Articles Using 140 Characters: A Propagation Analysis on Twitter Dr. Sudha Ram McClelland Professor of MIS and Computer Science Eller College of Management University of Arizona Tucson, AZ 86721 Email: ram@eller.arizona.edu April 2, 2013 INSITE: Center for Business Intelligence and Analytics www.insiteua.org This is joint work with Devi Bhattacharya Friday, April 05, 2013
  • 2. Motivation Twitter – Important hub of Social Media  140 million members1  340 million tweets/day1 Evolution of Twitter as a serious newswire - Credible news appearing on it before anywhere else on the web2  Iran election results controversy  Egyptian revolution 1 - as of March, 2012 via http://blog.twitter.com/2012/03/twitter-turns-six.html 2 - http://blog.twitter.com/2007/09/twitter-for-news.html, http://blog.twitter.com/2008/07/twitter-as-news-wire.html, http://www.sysomos.com/insidetwitter/engagement/ Friday, April 05, 2013
  • 3. Research Objective “Understanding and comparing the influence of news agency brands in a micro-blogging environment using network analysis”  Measuring the news article cascades caused by Twitter users’ participation  Investigating Twitter news agency Followers’ involvement in the propagation process Theory development for model Identifying and producing of online news article new network measures propagation 4/5/2013 4:34 AM
  • 4. Our Initial Questions  How do different news sources compare in volume and extent of spread of news articles?  How do the articles from different news sources spread and survive over time and what is their lifespan?  How do rates of spread compare across different news sources? Friday, April 05, 2013
  • 5. Data Collection Dataset of tweets (21st Nov till 13th Dec, 2011)  Tweet containing valid URL of articles from selected news sources.  Excludes tweets containing a URL that refers to the news source homepage  URL Minimum reach 5 or more (tweets,re- tweeted/posted)  6 Million tweets Via Twitter Streaming API and Phirehose application Friday, April 05, 2013
  • 6. Twitter Propagation Network  A weighted user-user network : G = (V, E, W) s v1 v2 where,  V : the set of nodes, representing the users on Twitter who tweet/retweet about a news media article  E : the set of edges, signifying two users who are linked via tweet-retweet/reply relationship  W : the normalized edge weight representing the strength of the tweet-retweet/reply relationship. (Strength s = Number of times v2 retweets/replies v1) This network is named as Twitter Activity Network (TAN)  Captures the user participation in news propagation for the entire time period 4/5/2013 4:34 AM
  • 7. Example of Twitter Activity Network NYTimes TAN for 1 week activity #Nodes : 58,945 #Edges : 29,676 4/5/2013 4:34 AM EGO
  • 8. Example of TAN: WaPo Washington Post Non-media seeding node cascade streams Disconnected network section – single user nodes capturing tweets with no response Friday, April 05, 2013
  • 9. QUESTION 1 How do different news sources compare in volume and extent of spread of news articles?  Nodes volume and edge/node ratio comparison using TAN  Ego network node analysis Friday, April 05, 2013
  • 10. Diffusion Network Statistics News Source Media Seeding Nodes Edges Diameter Edge/Node Nodes Ratio BBC bbcworld, 3801581 154276 341 0.406 Higher number of Lowest bbcbreaking, Bloomberg, NPR, Washington long cascade diameter High NYTimes Vs. bbcnews Post, Forbes, and Wired : High edge/node streams Reuters reuters Sparse # 61440 27035 non-media nodes 1 Washington Post retweets/replies – % of 8 0.44 ratio Guardian guardiannews 21454 9250creating cascades 3 0.4312 High concentration of NYTimes as information Acting nytimes 1556503 individual tweets63436 143 0.4084 NPR boosters for news Financial each nprnews 40454 16459 4 0.407 other agencies Washington Post washingtonpost 982474 40791 network's three major seedingLow BBC 152 0.4153 edge/node nodes – bbcworld, bbcbreaking and FT ft 10242 2925 4 long Fewer bbcnews 0.286 ratios Information spread Forbes concentrated at levels 88630 forbes 28718 9 cascade 0.324 Bloomberg closerbloombergnews 26487 to the seeding 8719 streams. 104 0.33 Arstechnica node arstechnica 13313 5016 3 0.377 Mashable mashable 1971092 78620 8 0.398 Wired wired 43744 14786 9 0.338 Superscript values indicate rank for that column Friday, April 05, 2013
  • 11. Example of Twitter Ego Network NYTimes Ego Network for 1 week activity #Nodes : 22,176 #Edges : 23,081 Ego Network ⊆ TAN 4/5/2013 4:34 AM TAN
  • 12. Analyzing News Cascades – Diffusion Depth Day-wise Comparison of Average Diffusion Depth 8 BBC News Guardian – M NYTimes – 7 – 5.43 4.00 2.14 Tu Diffusion Depth 6 Washington FT – 1.71 Post – 4.00 W 5 Arstechnica – 2.29 Th 4 3 F 2 Sa 1 Su 0 Avg Lowest Average Highest Average Weekly Diffusion Weekly Diffusion Depth Depth 4/5/2013 4:34 AM
  • 13. QUESTION 2 How do the articles from different news sources spread and survive over time and what is their lifespan? Analyzing Lifespan: Time difference between the last and first tweet posted containing the URL to an article Examining article survival distributions Friday, April 05, 2013
  • 14. Life Span of Articles on Twitter Shortest Lifespan Longest Lifespan NYTimes Bloomberg Wired Forbes Mashable BBC • Y-Axis: Count of articles surviving • X-Axis: Time Progression from first tweet submission time (In Hours) Friday, April 05, 2013
  • 15. QUESTION 3 How do rates of spread compare across different news sources analyzed in comparison with tweet posting activity?  Concept of URL half-life  Cumulative and non-cumulative trends Friday, April 05, 2013
  • 16. Cumulative Number of Tweets Posted News Sources Exceeding Forbes BBC Half-Life - Fair News Sources Exceeding Half-Life - Mashable Most Popular Washington Wired NYTimes Reuters Post • Y-Axis: Cumulative number of tweets posted for a given article • X-Axis: Time Progression from first tweet submission time (In Hours) • Tweets Frequency restricted between 0-400 and Time Progression to 72 hours Friday, April 05, 2013
  • 17. Summary of Findings • BBC – Maximum reach in terms of affected users, retweet levels and distribution of Twitter users in tweet-retweet ladders. – Best article survival chances, with 0.1% of articles surviving for three days or more – Median lifespan of article is 55 hours from first submission. – Majorly supported by 2 other BBC Twitter accounts – bbcbreaking and bbcworld. • Reuters – Average article lifespan and rate of spread – Best edge/node ratio - indicating highest percentage of user interactions via retweets and replies • NYTimes and Mashable – Similar tweets volume and rate of spread – Mashable - high percentage of level 1 tweets and NYTimes - tweet-retweet cascades resulting in a diameter of 14 levels. • Guardian – Most of the tweets in its ego network are concentrated in the first level – Second highest edge/node ratio. – Highest median number of tweets posted in the first hour – Third highest rate of spread Friday, April 05, 2013
  • 18. Applying the network measures for news agency comparison  Considering the collective influence of the network measures on news agency article propagation Diffusion depth and Diffusion magnitude appear to be inversely related Follower engagement seems directly News Agencies with Involvementproportional to of followers high tweeting activity is more important magnitude Diffusion than the number diffusion have high of followers depth 4/5/2013 4:34 AM
  • 19. Implications  Exhaustive analysis of news media agency related tweets  Leads for developing diffusion strategy to maximize audience reach  Magnitude Vs. Depth  Identification of useful metrics to evaluate propagation Diffusion Follower Participation, Lifespan and Diameter User Volume related metrics Edge/Node Node composition at Cumulative and Ratio, Reach different levels of ego non-cumulative rate network of spread Friday, April 05, 2013
  • 20. NPR Video Click here to view the NPR news item on our work: http://www.insiteua.org/news/Twitter_Be comes_News_Media_Tool.asp 4/5/2013 20
  • 21. Publication Reference For more information please see our research paper: Devipsita Bhattacharya and Sudha Ram, “Sharing News Article Using 140 Characters: A Diffusion Analysis on Twitter”, IEEE 2012 International Conference on Advances in Social Network Analysis and Mining (ASONAM 2012), pp, 966-971. http://www.computer.org/csdl/proceeding s/asonam/2012/4799/00/4799a966- abs.html 4/5/2013 21
  • 22. Harnessing the Power of Social Media Using INSITE Facebook ID: BusinessIntelligenceAndAnalyticsCenter Twitter ID: insite_ua 4/5/2013 4:34 AM
  • 23. Questions INSITE: Center for Business Intelligence and Analytics URL: www.insiteua.org Contact Information: ram@eller.arizona.edu 4/5/2013 23